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Foreword 


I begin my first book with words of gratitude and deep respect for my 
father, Mikhail Ivanovich Zolotarev, whose whole life has been connected 
with the Soviet Army from the moment of its formation. This monograph 
is dedicated to him. 

More than 50 years have passed since the appearance of the concept of 
a stable distribution in Paul Lévy’s 1925 book Calcul des probabilités. Our 
knowledge about the properties of these remarkable probability laws has by 
now become so much richer that it could fill several monographs. However, 
no monograph dealing specifically with stable laws has yet appeared. There 
are numerous and diverse results relating to stable laws scattered in journal 
articles or, at best, appearing as auxiliary sections or chapters in books on 
other branches of probability theory. For example, information about limit 
theorems for sums of independent random variables when the limit distribu- 
tions are stable laws can be found in the well-known monographs of Gnedenko 
and Kolmogorov [26], Feller [22], Ibragimov and Linnik [35], and Petrov [65]. 
The main part of the results about characterizing stable laws is in the book 
of Kagin, Linnik, and Rao [38]. 

A number of facts reflecting the analytic properties of stable laws are con- 
tained in the Feller and Ibragimov-Linnik monographs, the book of Lukacs 
[54], and the well-known survey article of Holt and Crow [31]. Some properties 
of homogeneous stable processes with independent increments are included in 
Skorokhod’s book [77]. 

Nevertheless, this information about stable laws is of a nonsystematic, frag- 
mentary nature and does not permit one to form a sufficiently complete picture 
of the contemporary level of knowledge in any particular direction. This is 
apparently explained and to a certain extent justified by the fact that, with 
rare exceptions, stable laws did not find applications for a long time. How- 
ever. the situation changed in the 1960s after the appearance of a series of 
papers by Mandelbrot and his successors, who sketched the use of stable laws 


vil 


viii FOREWORD 


in certain economic models. And there is now a basis for believing that the 
role of these laws in the areas of economics, sociology, and biology where the 
Zipf-Pareto distribution appears will grow in the future. For that to happen, 
of course, a systematization and more thorough exposition of the known facts 
are needed. 

By way of a preliminary classification, these facts can, in our view, be 
divided into the following four groups. 

1. Limit theorems for sums of independent (or dependent in a special way) 
random variables, along with various refinements of them such as estimates 
of the rate of convergence to limiting stable distributions in diverse metrics, 
asymptotic expansions, large deviations, etc., as well as properties of stable 
processes with independent increments. 

2. Characterization of stable distributions. 

3. Analytic properties of stable distributions. 

4. Statistical problems associated with stable distributions (estimators of 
the parameters determining these distributions, problems involving hypothesis 
testing, and so on). 

By now a fairly large number of results have accumulated in the first three 
groups, and they permit us to speak of the need for a systematic presentation 
of them. 

But the realization of this program would require a volume three or four 
times the size of the present book, and that exceeds my scope by far. This 
circumstance prompted me to undertake the more modest task of selecting 
only one of the groups of results for systematic exposition. 

The choice of the third group as primary had to do with two considerations. 
First, the material relevant here was already sufficiently established, and the 
influx of new facts is not great at present, which certainly cannot be said of, 
for example, the first group. Second, I hope that the systematic exposition of 
the analytic properties of one-dimensional stable laws will stimulate analogous 
investigations for multidimensional stable laws, of whose properties very little 
is known to us. 

In addition to an Introduction and the second and third chapters, which 
contain the main bulk of information about the analytic properties of stable 
laws, the book also includes examples of the occurrence of stable distributions 
in applied problems (Chapter 1) and a chapter connected with the problem of 
Statistical estimation of the parameters determining stable laws. However. the 
inclusion of this chapter was dictated not so much by the desire to reflect the 
material of the fourth group to some extent, as by the desire to demonstrate 
the possibility of exploiting the analytic properties of stable distributions in 
solving statistical problems. 
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The structure of the book is such that only the Introduction and the second 
and third chapters are interconnected. The first chapter is not necessary for 
understanding the remaining material, and the fourth chapter makes only 
minimal use of the information in Chapters 2 and 3. 

Although in preparing the material for the main chapters I tried to en- 
compass known results as completely as possible, it is not excluded that some 
facts accidentally escaped my view. 

The same can be said of the first three sections of the references at the end 
of the book. The fourth section, which deals with limit theorems connected 
with stable laws, was put together only for a rough orientation of the reader, 
and thus I did not even try to make it complete. 

Information of a historical nature or concerning priority is reduced to a 
minimum in the main text and is relegated to the section entitled Comments. 

The material in the Introduction and the second and third chapters is 
fundamental to the book. A part of this material contains new results, while 
another part of the collected material was essentially revised in the preparation 
process, both in the formulations of the theorems and in their proofs. 

A characteristic feature of the exposition is that to describe the properties 
of stable laws we use not one but several formally different ways of expressing 
the characteristic functions corresponding to these laws. The fact of the mat- 
ter is that there simply is no single form of expression leading to the simplest 
formulations of the properties presented for stable laws. The use of different 
ways of expressing the characteristic functions enables us to concentrate all 
the formulational complexities not of fundamental importance in the formulas 
for passing from one form to another. 

In this work I used the advice and all-around help of my friends and col- 
leagues, of whom I should mention first and foremost V. V. Senatov and I. S. 
Shiganov. I. A. Ibragimov became familiar with the book and made a number 
of reasonable comments leading to its improvement. I received valuable advice 
from Yu. B. Sindler and L. A. Khalfin on certain parts of Chapter 1, and from 
I. V. Ostrovskii on isolated questions in Chapter 2. The material in Chapter 
4 benefited greatly from a discussion of its content with È. V. Khmaladze 
and D. M. Chibisov. N. Kalinauskaitė helped complete the bibliography. 

Professor J. H. McCulloch of Ohio State University was invaluable in sup- 
plying me with copies of several articles from American publications not easily 
accessible to me. 

G. D. Smychnikov and L. L. Petrov assisted me in the preparation of the 
manuscript. 

Here I express to all these people my sincere gratitude. 


Moscow, 1981 V. M. Zolotarev 
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Introduction 


I.1. Despite its very close kinship with the normal law, the family of stable 
distributions has never enjoyed the great interest of mathematicians in prob- 
ability theory. The laws in this family, which came to light due to the talent 
of Paul Lévy and received their name from him, attracted only moderate at- 
tention from the leading experts, though there were also enthusiasts, of whom 
our own remarkable mathematician Aleksandr Yakovlevich Khintchine should 
be mentioned first of all. The reserve met by stable laws was apparently due 
to the fact that they did not find any applications for a long time. However, 
there was one exception here. In 1919, several years before the appearance 
of Lévy’s monograph introducing the concept of a stable law, the Danish as- 
tronomer Holtsmark published a paper [114] where he found a probabilistic 
principle obeyed by random fluctuations of the gravitational field of stars in 
space under certain natural assumptions about their distribution and masses. 
It turned out that the three-dimensional Fourier transform of the density p(x) 
of this probability distribution law has a very simple form: 


J exp{i(t, xz) }p(x) dz = exp(—Alt|?/?), IER, 
R3 


where is a positive constant determined by the physical characteristics of 
the object under consideration. 

As became clear much later (the article did not at first attract the attention 
of mathematicians), the distribution found by Holtsmark and now bearing 
his name belongs to the collection of spherically symmetric stable laws with 
parameter a = 3/2. 

In our times there has been a sharp increase in the interest in stable laws, 
due to their appearance in certain socio-economic models. 

Returning to the prehistory of the appearance of stable laws, we should 
mention that Lévy’s creation did not arise from nothing. Already a century 
before, Poisson and later Cauchy turned their attention to the distribution 
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with density 


= E À 
p(x) = A 1p (2A ) = Fg?) s0 cer’, 


which has the property that 


Pr, *Pro = PA; (x) 


where A is uniquely determined by A; and Az. Cauchy, who in the context 
of his statistical investigations was interested in finding out when the error 
in an individual observation can be comparable with the average error in a 
series of many independent observations, discovered that such is the situation 
if the errors in the observations have probability distribution with density py. 
The density p) is included (for the value a = 1) in the set {f ta)y of functions 
whose Fourier transforms have the simple form 


exp(—Alt|*), a >0, A>0. (+*) 


Cauchy knew [8] that each of the functions f ie has the property (*), but 
except for f 1) = p) he could not distinguish among them the nonnegative 
functions (each such function is clearly the density of some probability dis- 
tribution). Only in the beginning of our century did the famous Hungarian 
mathematician Polya succeed in proving that f ta) > 0 in the case0 <a < 1 
(11).t He proved that among the functions f (a) there is only one that is the 
density of a distribution with finite variance. And the value corresponding 
to it is @ = 2, i.e., the density of an unbiased normal law is such a function. 
The fact that the functions f (a) are the densities of probability distributions 
only for 0 < a < 2 was established in full by Lévy in his first monograph [50]. 
For a long time, stable distributions with characteristic functions of the form 
(**) were called Cauchy laws. Unfortunately, the tradition later died out. and 
distributions of this type (except for p,) became known simply as symmetric 
distributions. 

The concept of stable distributions took full shape in 1937 with the ap- 
pearance of Lévy’s monograph [51] and Khintchine’s monograph [44]. It then 
became clear that, with certain exceptions, stable laws do not have explicit ex- 
pressions for their densities or distribution functions. The question therefore 
arose of indirectly investigating the analytic properties of stable distributions. 
In past years there have been many investigations dealing with this problem. 
As our knowledge has accumulated about the analytic properties of stable dis- 
tributions (which form a four-parameter family of functions), it has become 


tRaised numbers in this form refer to the corresponding notes in the Comments at the 
end of the book. 
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clear that the complexity involved in investigating various groups of proper- 
ties depends in large part on how fortuitously the parameter system is chosen. 
Expressing analytic relations between stable distributions, and proving them, 
can thus be simpler or more complicated depending on what parametrization 
we employ. This circumstance makes it not only possible but even desirable 
to use several parametrizations on an equal basis, choosing in each specific sit- 
uation the one ensuring the least complexity in statements and proofs. With 
this approach we free ourselves of additional complications that in a reason- 
ably chosen parameter system are put aside and concentrated in the formulas 
for passing from one system of parameters to another. 


I.2. It can be said without exaggerating that contemporary probability 
theory has acquired its independence, its authority as a mathematical disci- 
pline important for applications, and a large part of its arsenal of methods 
in the course of the solution of the problem of approximating distributions of 
sums of independent random variables. The results here, which are grouped 
around the central limit theorem, have come to form in our time a great the- 
ory with many branches. Associated with the creation of this theory, which 
was roughly completed only in the 1930’s and which required in all more than 
a century and a half of effort by several generations of mathematicians, are 
such illustrious names as Laplace, Gauss, Poisson, Tchebycheff, Lyapunov, 
Markov, Lévy, Khintchine, Kolmogorov, and Gnedenko. The theory of limit 
theorems for sums of independent random variables, which has already be- 
come classical in our time, continues to nourish many new areas of probability 
theory, including the theory of Markov chains, stationary processes, martin- 
gales, distributions on groups, and so on. 

In the theory of summation of independent random variables Khintchine 
[44] obtained a fundamental result which underlines the first stage of the 
investigations. It consists in the following. 

Consider a sequence {Xnj,j = 1,2,..., kn}, n = 1,2,..., of series of inde- 
pendent (in the individual series) random variables, and form the sums 


fa Ani? soo + Xnk, — An, (1.1) 


centered by some numbers An. It is assumed that the terms in the sums are 
uniformly infinitesimal, i.e., for each € > 0, 


lim max{P(|Xn;| > €): j =1,..-, kn} =0. (L2) 


Denote by F, and Faj the distribution functions of the respective random 
variables Z,, and X,,;, and by © the set of all distribution functions G that 
can be weak limits of such functions Fn as n — oo. 
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THEOREM A. A distribution function G belongs to È if and only if the 
characteristic function g corresponding to it can be written in the form 


g(t) = exp fita — bi? + J (e*? — 1 —itsinz) dH(2)| (1.3) 
x #0 


where a,b > 0 are real numbers, and the function H(x), which is defined on 
the whole x-axis except for the point x = 0, is nondecreasing on the semi-azis 
x <0 and z > 0, tends to zero as |x| —> ov, and satisfies the condition 


f z? dH(xr) < oo. (1.4) 
0<ļ|z|<1 


The representation (1.3) is called the canonical form for the characteristic 
function g, and the function H is called the spectral function of the infinitely 
divisible distribution G (12). 

The following result of Gnedenko [25] was the second important achieve- 
ment of the theory. Let 


k 
E(Xnj1(|Xnj| < y)), 
J=1 


3 


where I(A) is the indicator function of an event A and y is a fixed positive 
number such that y and —y are points of continuity of H, and let 


kn 
= ar(Ana i XasbSelh? aÀ, 


THEOREM B. Let G be a distribution function in the class ® whose char- 
acteristic function in the canonical form (1.3) is determined by the triple a, b, 
H. In the sums (1.1) choose the centering constants 


An=anly)—a~ f ZOER u`! dH(u). 


Then the distribution functions F, converge weakly to G as n > œ if and 
only if: 
1. at each point x of continuity of H 


k 
z 1 
22 (Fajl) -o $ sgn z) = H (zk (1.5) 
2. 
lim lim supo; = lim liminf o£ = 2b. 


n— 00 e—0 n=œ 
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Proofs of Theorems A and B (possibly formulated differently) can be found 
in any solid course in probability theory; for example, in [22] or in the well- 
known monograph [26], which deals especially with the problematics of limit 
approximations of the distributions of sums of independent random variables. 

The simplest variant of the scheme (I.1) is a sequence of linearly normalized 
sums of independent and identically distributed random variables: 


Zn = (Xi + +++ + Xn) Bz! — An, n=1,2,..., (1.6) 


where B,, > 0 and A, are real constants. 
For condition (I.2) to hold in the scheme (I.6) it suffices that 


Bn >œ asn— Oo. (1.7) 


It turns out that this property holds whenever the distribution functions 
F,,(xz) = P(Zn < x) converge weakly to a nondegenerate distribution G. 

Indeed, suppose that (I.7) fails to hold. Then, clearly, there is a subse- 
quence ng — oo such that Bn, = O(1). The convergence Fp — G implies 
that for each t in any bounded interval 


f(t/Bn, )I"* = lg(t)|(1+ o(1)) as k > o, 


where f and g are the characteristic functions of the distributions F(z) = 
P(X, < £) and G(z). Consequently, for each sufficiently small value of t 


If(t)| = |9(tBn,)[/"* (1 + 0(1)). 


But this relation is possible only if |f(t)| = 1. This means, in turn, that 
\g(t)| = 1, i.e., the distribution G is degenerate, contrary to assumption. 

An analysis of the limit behavior of the distributions F» in the scheme (1.6), 
carried out (under strong restrictions on the distribution of the terms) near 
the end of the 18th century by Laplace and Gauss, led to the emergence of the 
first and, as it later turned out, most important distribution in probability 
theory and its applications—the normal distribution (sometimes also called 
the Gaussian law): 


Palt) = (roy a exp (-zzt - a)?) dt. 


Once the central limit theorem was proved in a very broad treatment a 
hundred years later through the efforts of Lyapunov and Markov (who used 
completely different methods), the experts could switch their attention to the 
analysis of the variety of limit distributions connected with the scheme (1.6) 
and its generalizations. Lévy made the first fundamentally new step in this 
direction. We begin our acquaintance with this result and its subsequent 
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development by a formal definition of stable laws, to the study of whose 
properties we direct all our attention below. 

DEFINITION. A distribution function G is said to be stable if it can occur 
in the scheme (I.6) as a weak limit of the distribution functions F, as n — oo. 

The set of all such functions G is called the family of stable laws and denoted 
by G. 

Since the scheme (I.6) is a special case of the scheme (I.1), the family of 
stable laws is a subset of the set © of infinitely divisible distribution laws. 

There are many different criteria for a distribution function to belong to 
the family G, and, if desired, any one of them can be taken as the original 
definition of stable laws. Some of these criteria will be considered below. 
We begin with the following one, which, by established tradition, is used 
particularly often as a definition. This tradition stems from work of Lévy [51] 
and Khintchine [44]. 


CRITERION 1. A distribution function G belongs to the family © if and 
only uf it has the following property. 

For any positive numbers bı and bz there exist a positive number b and a 
real number a such that 


G(x/b1) * G(x/b2) = G((z — a)/b). (I.8) 


See §I.4 for a proof. 
In Lévy’s first monograph [50] the definition of stable laws used is not (1.8) 
but the more restrictive relation 


G(x/b1) * G(x/b2) = G(x/b), (1.9) 


which corresponds only to a certain subfamily W of G. Some years later, in 
the second monograph [51], Lévy did start out from the definition (1.8), but 
called the laws in the complement 6\20 quasistable laws. Unfortunately, this 
term did not take root in the literature, possibly because an overwhelming 
number of experts did not feel that it was natural to single out the subclass 
W of G in the scheme of the problems they were considering. Here Feller 
was a characteristic exception. Following the terminology in his book [22], we 
shall call the distributions in W strictly stable distribution laws. 

If in the scheme (I.6) we trace the situation leading to limit distributions 
in the class %9, then it turns out to correspond to the case when the linear 
normalization does not need to be centered, i.e., when we can choose the 
constants A, = 0. 
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I.3. Before passing to the proof of Criterion 1 and to a discussion of other 
criteria, we find it convenient first to obtain by purely methodical considera- 
tions a description, analogous to that given above for the class © of infinitely 
divisible distributions, for the family G of stable laws and the subset 2 of it. 


THEOREM C.1. For each stable distribution G the spectral function H 
corresponding to it has the form 


i —C\z~* yr >O, 
Has eum fz <0, 


where C1, Co, and a are nonnegative numbers, and 0 < a < 2. 


(1.10) 


PROOF. We can obviously confine ourselves to the case of a nondegenerate 
distribution G. Denote by ¥ y the set of points of continuity of H. Since G € 
G, there exists a sequence of independent random variables X1, X2,... with 
a common distribution function F and a sequence of pairs An, Bn (Bn — oo) 
of normalizing constants such that F (x) = P(Z, < x) converges weakly to 
G(x) as n — oo. According to the necessary convergence condition (1.5), this 
implies that as n — oo 


n(F(Bnz) — } — ¿ sgn z) > H(z), ze Xy. (1.11) 


Consider the case when z > 0. If H(z) # 0 (otherwise choose Cı = 0) on the 
semi-axis z > 0, then there is a point tọ E€ y for which q = —H (zo) > 0. 
For each t > 0 we now define the positive integer 


n=n(t) = min(k: Bito < t < Bei120). 


The function U(x) = 1 — F(z) is nonincreasing on the positive semi-axis, and 
hence 


U(Bn+1202) U (tz) < U (Bator) 

U(Bnzo) U(t) — U(Bn4+120) 

for any z > 0. Since n(t) — oo as t —> o, it follows from (1.11) that the 

left-hand and right-hand sides of the inequalities in (I.12) tend as t — oo to 
the common limit 


< (1.12) 


L(x) = —H(zox)/q 
if roz € XH. Consequently, on the one hand 
U(txy)/U(t) > L(zy) 
as t — oo for any positive x and y such that zoz, roy E XH. On the other 
hand, we can compute the same limit differently: 
U(tzy) _ U(tzy) U(ty) 


UG) ~U oe | POE). 
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In summary, we get the equation 
L(y) = L(x)L(y), (1.13) 


valid for all positive z and y except possibly for the points in some countable 
set. 

The function L(x) is defined at all points of the positive semi-axis, where it 
is also nonnegative and nonincreasing with increasing z, and it has the values 
L(1) = 1 and L(oo) = 0. Under these conditions a solution of equation (1.13) 
necessarily has the form L(x) = x~%, where a is a positive constant (see the 
monograph [30] for details on solution of the equation of the form (1.13) or of 
equations reducing to this equation). 

The function H(z) must therefore have the following form for xz > 0: 


H(z) = —qL (2/29) = 25 H (tos | = —Ci 2. 


The additional condition (I.4), which is imposed on any spectral function of an 
infinitely divisible (hence also of a stable) distribution, constrains the values 
of the parameter a to the interval 0 < a < 2. The constant C, > 0 is not 
restricted by any additional conditions. It is not hard to see this if together 
with the sequence of normalized sums Z, we consider another sequence of 
normalized sums of the form CZ,,, C > 0. Both sequences have limiting stable 
distributions, with respective spectral functions equal to H(x) = —C,r~* and 
Holt) = —C,C%zr-*. 
The case x < 0 is analyzed similarly and leads to the conclusion that 
H(z) = Co(—2z)~°, where Cz > 0 and 0 < 6 < 2. Let us show that a = 6. 
In the course of analyzing the case z > 0 it was established that 
U(tz) 1- F(tz) 
U(t) 1-F(t) 
This means (see [22]) that 1 — F(z) = x~%h,(z), where hı(z) is a slowly 
varying function. Consequently, by (I.11), 


nil — F(B,2)) = nB “hi (B,2)2~* > Ci2~° 


as n — oo for any z > 0. From this, using properties of slowly varying 
functions, we find that 


=> “ast oS. 


A E 


nB; *hı(Bn) ~ C1. (1.14) 
The representation F(x) = (—x)~fha(—x) for z < 0 is obtained in a com- 
pletely analogous way, and so 

na ha] ~ Co. (1.15) 


If we assume that the spectral function H(z) vanishes on neither semi-axis 
z >Q nor z < 0, i.e., that Cı > 0 and Cz > 0, then the relations (1.14) and 
(1.15) are possible only if a = 6. 
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REMARK. We note that the last part of the arguments in the proof allow us 
to determine the behavior of the normalizing coefficients Bn as n — oo in the 
case when F, in the scheme (I.6) converges to a nondegenerate distribution 
G with nonzero spectral function H (i.e., Cy + C2 > 0). Indeed, let Ci > 0. 
Then, according to (1.14), n ~ C1ıB2h7 (Bn), where 0 < a < 2. The basic 
order of variation of the right-hand side is obviously connected with Bf. 
Therefore, the basic order of variation of B, is a power, and the quantity 
h;(B,), as a function of n, varies more slowly than any power: it is a slowly 
varying function h,(n). Consequently, as n — oo 

B, ~ n/*(Cr"h, (n))/% = n hin), (1.16) 
where h(n) is a slowly varying function. In the theory of regularly varying 
functions the relation (I.16) would be a simple consequence of the fact that 
the function inverse to a regularly varying (but not slowly varying) function 
is again a regularly varying function. 

A description of the family G, to which we now turn, is obviously also a 
criterion for membership in G. 


THEOREM C.2. A nondegenerate distribution G belongs to the family S 
if and only if the logarithm of its characteristic function g can be represented 
in the form 

log g(t) = A(ity — |t|* + ttwa(t, a, 8)), (A) 

where the real parameters vary within the limits 0 < a < 2, -1 < p= I, 
—00 < Ņ < œ, À > 0, and 

tl®-!8tan(ra/2) ifaF#1, 

i | ' Bim) k i | A H 


REMARK. The index A is used to distinguish the given form of expression 
for characteristic functions of stable laws from the other forms of expression 
we deal with below. 

PROOF. Since G is a subset of 6, to make more precise the form of g we 
substitute the expression (I.10) for its spectral function H into the general 
form (I.3) and compute the resulting integrals. For 0 < a < 2 we have that 


log g(t) = ita — bt? — Cy Í ("7 — 1 —ttsinz) dz * 
0 


[© @) 
— ca | (e~** — 1 + itsin z) dx ° 
0 


= ita — bt? + CiQa + C2Q,. (1.17) 


Consider the function 
(0 @) 


Pals, p) = p | (*-e Mja "de, O<a<?2, 
0 
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in the domain Res > 0, Rep > 0. It is not hard to see that pa(s,p) is 
continuous with respect to œ in the interval (0,2) for fixed s and p and is 
continuous on the boundary of the domain (i.e., on the straight lines Res = 0 
and Rep = 0) for a fixed value of a. A comparison of the integral Qa with the 
function ©, shows that if we set s = i and single out the real part of pali, p), 
assuming here that p is positive, and then pass to complex p and replace p by 
—it, then we get the integral Qa, 1.e., 


an = Re Pa (2, p)| p=—it- 


It is not complicated to compute an explicit expression for pals, p). Assuming 
first that a Æ 1, we integrate by parts and find that 


p CO 
Pals p) = J (se** pe "r “de 
0 


l-a 
r(2-qa) 


a Ara 
l-a ts 


aula! 


We then pass to the limit as a — 1 and get the equality 


pı(s, p) = plog(p/s). 

This procedure leads us to the following expression for Qa: 

— |t|°T (1 — a) cos(ra/2) 

OL= — it(1 — |t|} (1 —a)sin(ra/2) ifa #1, 

— |t|r/2 — ct log |t| fas kL 

Let 
La 

(in the case œ = 1 the value of d is understood in the sense of the limit as 
a — 1); then 


d=(C, + C2) (2 


lg(t)| = exp(—bt* — djt|*). 


We show that the numbers b and d cannot simultaneously be nonzero. Since 
G € G, there exist a characteristic function f(t) and sequences of normalizing 
constants A, and Bn (— oo) such that 


rt 
Consequently, |f(tB;1)|" — |g(t)|. Assume that d > 0. In this case B, = 


n\/h(n) according to (I.16). Therefore, By, /Bnx > k~1/* as n > o for any 
integer k > 1. This fact then gives us that 


tB je = g(t) asn— oo. 


nk 
= (gka). 


—1l\jnk _ Bn _p-l 
FERDI = |i (ee By?) 
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On the other hand, this same limit must obviously be equal to |g(t)|. Thus, 
ip bek are — dlt|*) = exp(—bt? — de), 


for any integer k > 1. But this is possible only if b = 0. 


Define (14) 
oe d ifC,+C2> 0), 
oe a ire ants if Cy + C2 > 0, 
0 if Cy + Co =0, 
q= (a+a)/d, 
where 


= (C2 — C,)I'(1— a)sin(ra/2) ifa #1, 
0 ilas 

The right-hand side of (1.17) can now be written in the form (A) we need. 

The canonical representation (A) of the characteristic functions of stable 
laws has one disagreeable feature. The functions 9, and hence also the distri- 
butions G associated with them, are not continuous functions of the param- 
eters determining them, since, as is clear from the definition of the function 
wa(t,a, 3), they have discontinuities at all points of the form a = 1, 6 # 0. 
Taking the limits a* > 1 (a* #1), B* ~ 8 #0, à > A, and 7* — 7¥ not 
only does not yield the stable law with the parameters a = 1, 6, J, and À, 
but does not even yield a proper distribution in the limit. The whole measure 
goes to infinity. Moreover, we ourselves created this situation by concealing 
the compensating shift a in the parameter 74. In the original expression for 
the functions g the integrals Qa, as was especially emphasized, are contin- 
uous functions of the variable a. Thus, we can remove the discontinuity by 
selecting the shift aA~! = —8 tan(ra/2). The modification of formula (A) in 
this case is as follows: 

log g(t) = A(ity — |t|* + itwm(t, a, 8), (M) 
where (cis l 
E E | (\t}*-1 — 1)8 tan(ra/2) if a f 1 
— B(2/7) log |t| igst 

It is easy to see that the function wm is jointly continuous in its variables. 
The domain of variation of the parameters in the form (M ) is the same as in 
the form (A). The parameters of the forms (A) and (M) are connected by the 
following relations (the subscripts indicate that the parameters are associated 
with the corresponding form): 


aa=am, Ba=bm, Ya= Im —Btan(wa/2), Aa = Am: 
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There are many different forms of expression for the characteristic function 
of stable laws. One of them (whose use can be justified by considerations of 
an analytic nature) is the following. 


THEOREM C.3. The characteristic function g of nondegenerate distribu- 
tions G in © can be written in the form 


log g(t) = A(zty — |t|°wa(t, a, 8)), (B) 
“uu (—i(n/2)3K (a) sent) ifa#1 
_ f exp(—a(a a)sgnt) wfaFl, 
i a Par ie ite val, 


K(a) = a — 1 + sgn(1 — a), and the parameters have the same domain of 
variation as in the form (A). 


The connection between the parameters (which is not hard to establish by 
equating the right-hand sides of (A) and (B)) gives us the following relations 
(the main parameter a is the same in both forms): if a = 1, then 


Ba=6p, ŅJa=2yB/T, AA =TAB/2?2, (1.18) 
and if a Æ 1, then 
Ba = cot(ma/2) - tan(mBpK(a)/2), 
ya = 7B(cos(7BpK (a)/2)~*, (1.19) 
AA = Apcos(tGpK(a)/2). 


In the form (B), as in (A), stable laws are not continuous at points of the 
form a = 1. However, contrary to (A), the limit distribution as a* — 1, 
B* — B,y* — J, and A* — à exists if a* always remains greater than or 
equal to 1. Moreover, the limit is a stable law with characteristic function g 
of the form 


log g(t) = A(at(y + sin(m8/2)) — |t| cos(78/2)), 
where “+” is used if a* > 1, and “—” is used if a* < 1. 


I.4. In what follows, the characteristic functions g will be accompanied 
by parameter values, i.e., g(t) = g(t,a,{,7,A). In a special case we use the 
briefer notation g(t,a,3) = g(t,a, 8,0,1). If it must be underscored that a 
particular form ((A), (B), etc.) is being used to express the functions, then g 
(and the other quantities and functions connected with it) will be given the 
corresponding subscript. For example, using this convention, we write the 
following easily verified property: 


Iga(t, a, 8, y, A)| = exp(—Alt|®). (1.20) 
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It follows from this property of ga that the corresponding distribution G 
has a density g4 which exists and is uniformly bounded on the whole axis, as 
is any derivative of it. Indeed, by using the inversion formula 


1 
ga(r) = oe | eoad 
and property ` 20) it is easy to see that 


POLS g f laatade = AH y-oa 


The distribution Fe and density of the elie law with characteris- 
tic function g(t, a, 3,7, à) are denoted by G(z, a, 3,y, A) and g(z, a, 3,7, A), 
respectively, and, in shortened variants, 

G(,a, 8) = G(x, a, 8,0, 1), 
g(x, a, B) = g(x, a, 8,0, 1). 

Everywhere in what follows we use the respective notation Y (a, 3,7, 4) 
or Y(a,3) for random variables with stable distributions G(z,a, 3,7, A) or 
G(z, a, 3), and in general the symbol Y is reserved solely for denoting random 
variables with distributions in ©. 

We should stipulate one more rule which we follow throughout the book. 

In all equalities connecting functions of random variables with the meaning 
that they have the same distribution (the symbol 4 will be used for such 
equalities) the random variables on one side of an equality (even when written 
the same) are understood as being independent. 

For example, with the use of this rule, Criterion 1 can be formulated as 
follows. 


d 
A distribution of random variables X 4 £ Xz Æ const belongs to © if and 
only if for any positive numbers bı and bz there exist a positive number b and 
a real number a such that 


6X1 + boXo = bX +a. (1.21) 


Here no special mention is made of the fact that the X; and X2 on the 
left-hand side of (I.21) are independent. 

This way of expressing relations between distributions turns out fairly often 
to be preferable to an expression using the distributions themselves, the latter 
being more cumbersome and less intuitive. There will be more than one 
opportunity below to see the truth of this statement. 

Let us turn to the proof of Criterion 1 in the form (1.21). Induction argu- 
ments, beginning with the relation X; + X2 £ by Xı + ag, enable us to verify 


that 
Rite hp, mod 
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Consequently, for any n > 2 
Xi++::+Xn üa ay. 
by, By 
Obviously, the limit distribution of the left-hand side as n — oo exists and 
coincides with the distribution of X1, which must belong to ©. This proves 
that the criterion is sufficient. We show that it is necessary. 
Let X, 2 Xo £ Y (a, ß,Ņ, À), and choose positive numbers bı and 62. In 
terms of the characteristic functions g of the random variables Y, (I.21) is 
equivalent to the equality 


g(bıt)g(bzt) = g(bt)e*®. (1.22) 
By using the form (A) it is easy to verify this equality for the choices b = 
(bF + b9) and 


_ f Albi + b2 — b) fedi 
C | NB(2/m)(b log(b, /b) + b2 log(b2/b)) ifa=1. (1.23) 


CRITERION 2. The distribution of the random variables X: g Xə 2 


d 
Xn Se # const belongs to © if and only if for any n > 2 there exist a 
positive number b, and a real number an such that 


Ky tet et RES hte (1.24) 


In outward appearance this criterion seems to be a weakening of the condi- 
tion (1.21). It can be proved in the same way as Criterion 1. In turn, Criterion 
2 can be weakened in outward appearance (1°). 


d 
CRITERION 3. The distribution of the random variables Xı $ X2 2 X3F 
const belongs to © if and only if the relation (1.24) holds for n = 2 and n = 3. 


The proof of necessity is analogous to that for Criterion 1 and does not 
present difficulties. 
We prove sufficiency, i.e., we prove that if 


Re Sot ba ete, Sp Ne Ne ete + T 


where bı and bz are some real numbers, then the X; have a stable distribution. 
Denote by f(t) the characteristic function of the random variable X1. Our goal 
is to prove that f(t) satisfies for each integer n > 2 the equation 


f” (t) = f(bnt) exp(zant), (eR, 


which is equivalent to the condition (1.24). 
The validity of this equation for n = 2 obviously also implies its validity 
for any of the values n = 2” with constants b, and an of the form b, = b%”, 
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an = a2(1+b2 +--+ i T From this it now follows (see, for example, [54]) 
that the distribution of X, is infinitely divisible, and hence f(t) 4 0 for any t. 

Define the function y(t) = log s(t), t € R!. The condition of the criterion 
can be written in terms of x as follows: 


2x(t) = x(bat) + tagt,  3x(t) = x (dst) + tazt. 
Consequently, for any 7,k = 0,+1,+2,... 
293k y(t) = x(b4b§t) + taje, 


where the aj, are real numbers. 

The set {273*: j,k = 0,+1,+2,...} of numbers is dense in the set of all 
positive numbers. This follows from the well-known fact that the set of num- 
bers of the form 7 + wk, 7,k = 0,+1,+2,..., where w is irrational, is dense in 
the real line. Thus, for any n there exists a sequence of numbers rm = 21m 35r 
such that rm — n as m — oo. We let b,(m) = bir bkm and show that this 
sequence of numbers is bounded. From the functional equation for x(t), 


Tm Re x(t) = Rex(bn(m)t) 


for any integer m and real t. If the sequence b,,(m) were unbounded, then it 
would have a subsequence b,(m’) such that |b,(m’)| — œ as m’ — oo. Let 
us perform the change of variable t’ = tb,(m’) in the last equation. Then, 
considering that rm’ — n as m’ — oo, we have that 


Re x(t’) = rm Rex(t'/bn(m’)) > 0. 


Consequently, |f(t)| = 1. But this contradicts the condition that the distribu- 
tion of X; be nondegenerate. 

The sequence 6,,(m) is bounded, so it has a subsequence b,,(m’) converging 
to some number bn as m’ — oo. Hence, 


Oj 1 Kegs = tt [Xba (m JE) — rm x(E)] 
— it~ "[x(bnt) — nx(t)] = an 
as m’ — oo. Consequently, for any n and any real t 
nx(t) = x(bnt) + itan. 


But this equation is equivalent to (1.24), i.e., we have reduced the sufficiency 
of Criterion 3 to that of Criterion 2. 


I.5. Let us proceed to a description of the class W of strictly stable laws. 
The definition (1.9) can be rephrased as follows in terms of random variables. 
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The distribution of the random variables X; £ X2 4 const belongs to W 

if and only if for any positive numbers bı and b2 there is a positive number b 
such that 

bi X1 +b2X2 = bX}. (1.25) 


We begin with the necessary conditions. Let X, £ Xə g Yia, pA) In 
this case (1.25) is equivalent to the equality 


g(bıt)g(b2t) = g(bt), 


which is a particular case of (1.22). It is clear from the expression for a in 
(1.23) that this quantity is zero only if 


y=0 Hozi) o ġ=0 (a=) (1.26) 


Although this conclusion relates to the form (A), it is the same (as is clear 
from the transition formulas (1.18) and (1.19)) in the case when we use the 
form (B). 

The following is a criterion for a nondegenerate distribution to belong to 
the class W. 


CRITERION 4. Let cj = exp(—d;), J =1,...,k, k > 2, be numbers such 
that 
a) c? + +--+? <1, and 


b) among the numbers d,,...,d, there are two with an irrational ratio. 
d 
The distribution of the random variables Xı £ X92 Son Xk Æ const 


belongs to W if and only if 
C1 Xi +C2X2 + +++ +cnXp oe (1.27) 


The necessity in the criterion can be verified without difficulty in the same 
way as for the preceding criterion (by means of characteristic functions, with 
membership in %W taken into account in the form of the condition (1.26)). 


Furthermore, in the course of the proof the following facts become obvious. 


a) The parameter a of the random variables X, = Xə D Nr = 


Y (a, 8,7, A), satisfying the condition (1.27) is determined as the unique pos- 
itive solution of the equation cj + ---+ cj = 1. 

b) The parameters 3, y, and à (from the set of those not bound by the 
condition (1.26)) can vary within the limits of their domains of variation. 

The sufficiency of Criterion 4 is proved by fairly complicated analytic argu- 
ments, and we shall not present the proof here. Those wishing to go through 
it will find it in the monograph [38], where Theorem 5.4.2 contains the suffi- 
ciency assertion in Criterion 4 as a particular case (18). 
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The class W is formally described by four parameters, although it is actu- 
ally a three-parameter set of distributions (because of (1.26)). This is clearly 
inconvenient from an analytic point of view. We can pass to a system of 
three parameters by changing the original system (a, 3,7, A), and it is most 
convenient to connect the transition with the form (B). 


THEOREM C.4. The characteristic functions g of the distributions in W 
have the representation 


log g(t) = —Alt|* exp(—t(7/2)0a sgn t), (C) 
where the parameters a, 0, and X vary within the limits 
O<a<2, |0|<6,=min(1,2/a-—1), A>0. 


The case a = 1 occupies a special position in the parametrization system, 
because, in particular, the values of the parameters a = 1,6, and à with |0| = 1 
correspond to a degenerate distribution at the point 0. If we exclude this 
case, then the parameters a, 3, y, and à of the same characteristic function 
g in the form (B) are connected with the parameters ac, 0, and Ac by the 
equalities 

QC = QB, 
b = Bg K(a)/a, ÀC = ÀB if a Æ L (1.28) 
0 = 2 arctan(27B/T), àc =AB(n?/44+ 7%)? ifa=1. 


Together with the parametrization system a, 0, Ac we can use another 
parametrization system a, p, Ac close to it for the laws in W, where p = 
(1+ 6)/2. 

It is reasonable not to single this out as an independent form, and to call 
it also the form (C), since it is after all only a modification of (C). We need 
not worry about confusion if we use the notation a, 0, À for the parameters in 
the case of the form (C), and the notation a, p, for the parameters in the 
case of the modification. In the modified variant the characteristic function g 
has the form 

g(t) = exp{—A(zt)* exp(—z7apsgnt)}. 

We complete the description of the various forms of expression for the 
characteristic functions of stable laws by one more modification of the form 
(C) (it is more significant than the preceding one) which is needed in solving 
the problem of statistical estimation of the parameters of these laws. The 
logarithm of the characteristic function g of each distribution in I can be 
written in the form 


log g(t) = — exp{v~1/? (log |t| + 7 — i(t /2)0 sgn t) + Ce S-i (8 
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where C = 0.577... is the Euler constant, and the parameters vary within the 
limits v > 1/4, |0| < min(1,2,/v — 1), |r| < co and are connected with the 
parameters a, 6, and À by the equalities 


yv=a-’, Üp = 9c, r = (1/a) log Ac + C(1/a — 1). (1.29) 


Of course, one may puzzle over the question: why such an abundance of 
different forms for expressing the characteristic functions of stable laws? In 
studying the analytic properties of the distributions of stable laws we en- 
counter groups of properties with their own diverse features. The expression 
of analytic relations connected with stable distributions can be simpler or 
more complicated depending on how felicitous the choice of the parameters 
determining the distributions for our problem turns out to be. By associating 
with a particular group of properties the parametrization form most natu- 
ral for it we thereby minimize the complexity involved in expressing these 
properties. In this approach the extraneous complexity is, as it were, isolated 
from the problem and relegated to the formulas for passing from one form of 
expressing the characteristic functions g to another (13). 

We make one more important remark concerning the forms given above for 
expressing the functions g. In the form (A) the structure of the parameter 
Ga, te, C*) 

Ba = (C1 — C2)/ (C1 + C2) 


enables us to connect the extreme values 84 = 1 and G4 = —1 with two 
distinctive cases in the scheme (I.6). Indeed, suppose that the sums Z, in 
this scheme are formed from positive random variables X1, X2,... such that 


the limit distribution G is not normal. Then the left-hand side of the limit 
expression (1.11) of the spectral function H corresponding to G is identically 
zero for x < 0. Therefore, H is also equal to zero on the semi-axis x < 0, i.e., 
Cz = 0. Consequently, G has the parameter 04 = 1. 

In exactly the same way we find that the limit distribution G for the nor- 
malized sums Z,, has the parameter 34 = —1 in the case of negative terms 
X;. This connection between the properties of the terms X; and the extreme 
values of the parameter 3,4, though not reversible (i.e., for example, the X; 
are not necessarily positive when 3,4 = 1 for the limit law), nonetheless serves 
as a simple reference point in describing the properties of stable distributions. 
This suggests that it is desirable to have the same rule in the other forms of 
expression for the characteristic functions. 

It holds in the forms (A), (M), and (B), i.e., G4 = 1 corresponds to 8m = 1 
and g = 1. In the forms (C) and (E), however, 34 = 1 corresponds to the 
value 0 = a if a < 1 and to the value 0 = —@, if a > 1. Of course, it is 
easy to redefine the expressions (C) and (E) so that 6 = ĝa also in the case 
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a > 1. But here we must choose between two incompatible properties of the 
form (B): 

1) Ca = LHee=é,. 

2) The distributions in %W are jointly continuous with respect to the defining 
parameters in the whole domain of their admissible values. 

And in this situation the preference should undoubtedly be given to the 
second property (17). In the light of this it is worth turning our attention to 
the form (M), which has both the indicated properties. 


I.6. The definition given above for the family G of one-dimensional sta- 
ble laws can naturally be extended to the case of finite-dimensional and even 
infinite-dimensional spaces. We consider a sequence of independent and identi- 
cally distributed random variables X1, X2,... with values in the k-dimensional 
Euclidean space R* and form the sequence of sums 

Xn = Onl Xi + «+: + Xn) — An, ie Oe eee 

normalized by some sequences of positive numbers on and nonrandom ele- 
ments a, of R*. The set G* of all weak limits of the distributions of such 
sequences X, as n — œ is called the family of stable distributions on R* or 
the family of Lévy-Feldheim distributions. This is not the only way of gener- 
alizing the distributions in G. If the sums X; + --- + Xn are normalized by 
nonsingular matrices on and not by positive numbers, then the concept of sta- 
ble laws becomes essentially broader. Very little is known at present about the 
properties of multidimensional stable laws (in particular, about their analytic 
properties). Neither the amount nor the diversity of the facts known here can 
compare in any way with what is known about the distributions in G. For 
this reason a summation of facts on the properties of multidimensional stable 
laws would certainly be a premature undertaking at this time. We confine 
ourselves here to presenting and commenting on a canonical representation of 
the characteristic function g(t), t € RF, of finite-dimensional Lévy-Feldheim 
laws. 

It turns out that the distributions in G* now form a nonparametric set. 
The corresponding characteristic functions have the form 


g(t) =exp{z(t,a)—Yalt)}, W<a<2, (1.30) 
where a € RF and the functions Ya(t), which are determined by the parameter 
œ and by a certain finite measure M (d£) on the sphere S = {€: |€| = 1}, are 


as follows: 
If a = 2, then a(t) = (ot, t), where ø is the so-called covariance matrix. 


If 0 < a < 2, then 
bt = / (t, €)|®wa(t, €)M (dé), (1.31) 
S 
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where . 
1 — i tan(ra/2)sgn(t, £) fog 1, 


walt, €) = 1 + 2(2/7) log |(t, €)|sgn(t,€) if a =1. 

The representation (I.30)-(I.31) is an analogue of formula (A) for express- 
ing characteristic functions of one-dimensional stable laws. 

This analogue is not the only one. If we use a spherical system of coordi- 
nates in RF and write a vector t in the form t = rn, where r = |t| # 0 and 
7 = t/|t|, then it is not difficult to give the representation (1.30)—(I.31) the 
following form: 

liry —r%(1—iBtan(ma/2))| ifaF-~l, 

log g(rn) = ae —r(1+i(2/m)Glogr)] ifa=1, 

where 0 < a < 2 and 2, y, and à are real functions defined on the unit sphere 
S and determined by the equalities 


ka= [ (m \l@M(dé), nEs, 


(1.32) 


TET AR [ in £)” sen(n, EM (dé), 


(n,a) fa #1, 


We note some properties of the functions 8, y, and 4. 

1. They are continuous on S, and for a given value of a they uniquely 
determine the shift a and the measure M(dẸ) in the representation (1.30)- 
(1.31). In particular, for a given value a # 1 the functions 8 and À uniquely 
determine the measure M. This implies that the integral transform 


A(n) = [ (n, EI E) > 0)M(dé), nEs, 


of M on S (which is a function on S) uniquely determines this measure. 
2. The domain of variation for the values of the function y is the whole real 
axis. 
3. The following relations hold for any ņ € S: 
B(-n)=—-B(n), — A(=n) = A(n), 
BmI <1, 0 < A(n) < Mo, 


where Mo is the value of the complete measure M (d€) on S. Here all the 
inequalities are strict unless M (d£) is concentrated entirely on some subspace 
of RF. This leads, in particular, to the conclusion that 


ào = inf{A(n): n E S} >0,  |g(t)| < exp(—àolt|*), 
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and hence the corresponding stable distribution has density p(xr,a,a,M) 
bounded by the quantity 


(1+ k/a) 
(14+ k/2) 


Each of the forms (M) and (B) also has two analogues obtained by trans- 
forming the corresponding analogues of (A). Namely, if a 4 1, then 


log g(t) = a(t, a) — Jawa — i(t, €) tan(1/2)a(|(t, €)|*~* — 1)}M (d£), 
log g(rn) = Altry — r% + 7Br(r*—* — 1) tan(ma/2)}. (1.33) 


(2m0 °)" 


In the case a = 1 the functions log g(t) and log g(r7) are defined just as 
n (I.30)-(I.32). As a result, these representations turn out to be continuous 
functions of a in the whole domain 0 < a < 2 in which a varies. 

The first analogue of the form (B) is the representation (I.30)—(1.31) with 
Walt, E) replaced by the function 


" _ f exp(-15 A (a)sgn(t,€)) ifa#l, 
Walt, £) = | 5 + ilog llt Eagal, E ias 


and M (d£) replaced by the measure 


~ Jola Mdi) ifa 1, 
DEAL | (2/r)M (dE) ifa=l1. 


The second analogue of (B) is obtained from (1.32) by the same trans- 
formations used in the one-dimensional case for getting (B) from (A). This 
representation has the form 


_ f Aliry — r° exp(—i(t/2)K(a)b)] ifa #1, 
log a(n) = Airy — r((1/2) + ib log r)| tasi. ae) 


The elements a, 3, y and X in (1.34) determining the stable distributions 
are connected with the set of analogous determining elements in (1.32) by 
relations (1.18) and (1.19). 

In those cases when the measure M (d£) happens to be concentrated on 
a half S* of the sphere S, the first analogue of (B) allows us to write the 
Laplace transform of the corresponding stable distribution p(x, a, a, M ) bya 
componentwise analytic extension of g (the substitution t = is). Namely, 


Sea 
Rk 


afeti ela) fo. 6)°MUd0} ar 
exp{—(s, Sree E)log(s,€)M(dé)} ifa=1, 
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where e(a) = sgn(1 — a), and the vector s takes values in the half-space L* 
containing S*. 

The fact that the Laplace transform of the density p exists for the val- 
ues s € L* indicates, in particular, that the function p(z, a, a, M) decreases 
exponentially in the case when z ¢ L* and |z| — ov. 

This observation can be refined. Namely, let T C S* be the support of the 
measure M. The set T generates in R* a corresponding cone C contained in 
the half-space corresponding to the hemisphere S*. Denote by C the convex 
hull of C. 

In this situation it is relatively simple to see the following properties of the 
density function p: 

a) If0<a<1, then for all zr ¢ C 


plz +a,a,a, M) = p(z,a,0,M) = 0, 


and p(x, œ,0, M) > 0 for all interior points z of C. 

b) If 1 < a < 2, then p(z,a,a,M) > 0 for all points z € R*; however, the 
function p(x + a,a,a, M) = pe, ov, 0, M) decreases exponentially in the case 
when z ¢ C and |z| — œ. 

The representations given for the characteristic functions of stable distri- 
butions can serve as a starting point for various analytic investigations of 
properties of one-dimensional stable laws. At present the forms (1I.30)—(1.31) 
are those used most in the literature, apparently due to the manifestation of 
a peculiar “inertia” created by the first investigations. 

The analogues of the forms (B) and (M) have not yet been used in gen- 
eral, although they unquestionably have some merits that may in time make 
them no less popular. For example, these representations make it possible 
to connect the density p(z,a,a,M) with the densities g(u,a, 3,7, A) of the 
one-dimensional stable laws. This connection has the simplest form in the 
case when the dimension of the space is odd: k = 2m + 1. Namely, for all 
ze R* 

Sao) 


at ee (2m) 
2(27)2™ J.s ((7, x), a, p, y, A) dr, 


where (3, y, and à are the functions in the representation of the characteristic 
function of the distribution p(z, a,a,M), and where the density g(u, a, 3, +y, A) 
corresponds to the form (A) or (B), depending on which of the representations 
(1.32) or (1.34) we use. 

The functions 8, y, and A (independently of what representation they 
are associated with) are, respectively, the generalized asymmetry, shift, and 
scale characteristics of the distribution, as in their interpretation in the one- 
dimensional case. However, it should be borne in mind here that in carrying 


p(z,a,a,M) = 
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various concepts associated with one-dimensional distributions over to the 
multidimensional case we inevitably encounter a variety of possible general- 
izations. For instance, the analogue of the symmetric laws in G, which have 
characteristic functions of the form 


g(t) = exp(—A|t|*), 


can be taken to be the spherically symmetric distributions on R* having the 
same form of characteristic functions (with A = const), but the analogues of 
the symmetric laws can also be taken (and this is more natural) to be the 
stable distributions with the functions 3(n) = y(n) = 0 for all n € S, which 
is equivalent to the equality 


log g(t) = —A(n)r* = - J I(t, €)| M (d£), (1.35) 


where M(d€) is a centrally symmetric measure. 

It is still comparatively rare to encounter multidimensional stable laws in 
applications (we saw some examples of their occurrence in §1.1). However, 
there is reason to expect that this situation will change in the near future 
(due first and foremost to applications in economics). 


CHAPTER 1 


Examples of the Occurrence 
of Stable Laws in Applications(!°)* 


81.0. Introduction 


The main and most prominent user of stable laws thus far has been proba- 
bility theory itself. Stable distributions can be encountered in the most diverse 
areas in the contemporary mathematical literature. First and foremost, this 
involves limit theorems for sums of independent and weakly dependent ran- 
dom variables, and a variety of refinements of such theorems. Further, we can 
name the theory of random processes, in particular, the theory of branching 
processes, the theory of random determinants, and a number of other areas 
where problems connected with stable laws are encountered to some extent. 
The two volumes of Feller [22] can provide a first and fairly thorough acquain- 
tance with such problems, and have justly received worldwide recognition. 

The pertinent material here is so extensive that we have not attempted 
a comprehensive compilation. We have nevertheless regarded it as useful to 
reflect a part of this material, connected mainly with limit theorems, in the 
fourth section of the Bibliography. 

As illustrations we present three examples of the occurrence of stable laws 
in mathematical problems. 

EXAMPLE 1. The following problem is a classic in probability theory and 
is given in many publications (for example, in the first volume of Feller cited 
above). 

We consider a sequence of tosses of a symmetric coin and compute the 
difference A,, between the number of heads and tails in n tosses. 

This procedure can be represented schematically as a random walk along 
the integer points of the real axis under the assumptions that we are initially 
at the point z = 0 and that in any position a shift by 1 takes place to the 


* Raised numbers in this form refer to the corresponding notes in the Comments at the 
end of the book. 
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right or to the left with probability 3 independently of the position and 
independently of how this position was reached. 

The probability that a point r > 0 is first reached after n steps is denoted 
by prn- Of course, p;,n = 0 if the numbers r and n have different parity. If 
they have the same parity, then 


Prin = 5 " heni ait 


The probability g,(N) that the point r is first reached in no more than N 
steps is obtained by summing the probabilities p,,»: 


ar(N) = S (Prin: Lan < NN). 


Let N = 2(r?t], where [-] denotes the integer part, and the number t > 0 is 
fixed. Then a simple asymptotic analysis shows that as r — oo 


Ny) vir fe exp(—2?/2) dz = G(t, 3,1), 


i.e., the limit distribution of the - hit of the point r within 2{tr?] steps 
coincides with the Lévy distribution (see §2.2). i 

EXAMPLE 2. We consider a random branching process with discrete time 
and with a single type of particle. Each particle is transformed in one of three 
possible ways at the end of a unit of time: it disappears with probability 
p > 0, is transformed into the same particle with probability 1 — 2p, or is 
transformed into two particles with probability p. The generating function 
F(s) of the transformation probabilities for a single particle (this function 
determines the random process) has the form 

F(s) = p+ (1 —2p)s + ps”. 

Denote by v(t) the number of particles in the (t—1)st generation that do not 
have descendants (i.e., that disappear at the appearance of the tth generation), 
and by v = 5-2, vo(k) the total number of such particles that appear in the 
process during the entire time 0 < t < oo of its evolution (vo(0) = 0, because 
precisely one particle was considered at the beginning of the process). The 
generating function y(u) = } o gnu” of the distribution qn = P{v = n} is 
connected with the function F(s) by the equation y(u) = F(y(u)) + p(u—1), 
which easily yields the following explicit expression for (u): 


plu) = 1-VI=a= Dg DETAN 


n! 


This implies that 
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This asymptotic expression indicates that the distribution of the random vari- 
able v belongs to the domain of normal attraction of the stable law with 
parameters (in the form (B)) a = 1/2,@ = 1,7 =0, and A = 1. 

Thus, if at the present time there are not one but n particles, then the 
total number of particles without descendants is expressed by a sum vı + 
--+ + Vn of independent random variables with the same distributions as v. 
After normalization, the sum (vı +-+- + ,)n~? has a distribution that is 
approximated by the stable distribution G(z, T 1) as n increases. 

It is worth remarking that examples of stable laws with various values of 
the main parameter a can be found among this kind of problems involving 
branching processes. For instance, [74] mentions situations in which stable 
laws with a = 2~",r = 1,2,..., appear as limit laws. 

EXAMPLE 3. This example is taken from the theory of random matrices 
(concerning this see [24]). We consider systems of linear algebraic equations 
of the form 

Haia = Was ET e re (1.0.1) 
where En = i )) is an n x n matrix with random elements xe ) and 
Wg = (w) is a random vector. As we know, there is a unique solution of 
(1.0.1) if det =, Æ 0, and 

i, = (x\”) =B Was 


In the case when det £,, = 0, a solution of (1.0.1) can fail to exist, and in this 
situation we agree to assign Xn the value x, = 0. For large values of n, when 
solving linear equations becomes a very laborious computational problem, the 
following limit approximation can supply a certain amount of information. 
Assume that for each value of n the random variables x ) and wir), i,j = 


1,...,n are mutually independent, EX: ) = Ew.” = 0, Var K ) = Var wir 
= 1, and the quantities sup, ; ; SP aide + wi” |5) are finite. Then the 
following limit relations hold for any 1 <1,7 < n,t Æ J: 


lim P(x") < €) = lim P(z{”/z}” <€) 
= $+ Aarctan € = Ga(€, 1,0). (1.0.2) 


It turns out that an analogous limit distribution arises if we consider the 
joint distributions of any finite number of components of the solution Tn. 
Namely, under the same assumptions 


lim P(x’) < &1,...,26") < &) 
n—- co 


— „-(k+1)/2p (* + *) J (1 + ful2)-@+))/2 du 
2 uj <j 
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for any fixed k, i.e., the limit law here turns out to be a k-dimensional Cauchy 
distribution. 

It should be mentioned that the Cauchy distribution (both one-dimensional 
and multidimensional) occurs in various problems more frequently than other 
stable laws, yielding only to the normal law in this respect. The special 
position of the Cauchy law, like that of the normal law (+-+), can be observed 
also in the analytic setting. 

To these laws we could also add the Laplace distribution, assigning the 
value a = 0 to it conditionally (there are well-known grounds for this; see, for 
example, the relation (2.9.1)). The symmetric laws with integer values of a 
thereby distinguish themselves from other stable laws by their significance. 

The Lévy law with a = 1/2, = 1, and y = 0 can also occur fairly often 
in problems. Both the Cauchy law and the Lévy law are closely connected 
with the normal law. This is clear from the fact that the ratio N;/N2 of 
independent random variables with the standard normal law has a Cauchy 
distribution, while the random variable N; * has a Lévy distribution” (in the 
final analysis, it is the first circumstance that explains the appearance of the 
limit relation (1.0.2)). 

There are more than a few examples in which the mechanism for the ap- 
pearance of the Cauchy law reminds us little of our usual mechanism for 
summation of random variables. We illustrate this by another example taken 
from [22]. 

EXAMPLE 4. Consider a point source of radiation on the plane of a foil 
located parallel to a screen at a distance equal to 1. The radioactive emission 
causes point flashes of light on the screen. Our problem is to compute the 
distribution of these flashes on the screen. First of all, choose a Cartesian 
system of coordinates (zx, y, z) in space such that the (x, y)-plane coincides with 
the plane of the screen, while the radiation source has coordinates (0,0, 1). 

The points at which we see flashes are randomly located on the plane. 
Denote the coordinates of one of these flashes by (U,V,0). Of course, the 
geometric symmetry in the very statement of the problem implies that the 
random vector (U,V) has a circular distribution, for the determination of 
which it suffices to find the distribution of one of its coordinates, say U. With 
this aim we project the whole process on the (z, z)-plane. In the planar model 
the radiation source has coordinates (0,1), and the random positions of the 
flashes on the z-axis have the coordinate U. 

Denote by F(x) the distribution function of the random variable U; F(z) is 
the probability that a ray hits the axis to the left of the point with coordinate 


*These facts follow from relations in Chapter 3. 
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(z,0). All the rays satisfying this condition are included in an angle equal to 
t/2 + arctan z, measured counterclockwise from the line z = 1. All the rays 
that can hit the screen are obviously included in the angle 7. Moreover, it 
is not hard to see that the angle p corresponding to the random point (U,0) 
has a uniform distribution between the limits 0 and 7. Consequently, 


F(z) = 5 + +arctan z = oY a L0), 
i.e., the component U (as well as V) has a Cauchy distribution. 
Since the vector (U,V) has a circular distribution, it coincides with a two- 
dimensional Cauchy distribution. 
The distributions in the Cauchy family having densities of the form 


ga(a,1,0,A~*4, A) = ZA 4 (r-a) (1.0.3) 


play an important role in contemporary physics. Distributions of this type are 
known under their own names in molecular physics, atomic physics, nuclear 
physics, and the physics of elementary particles. 

For example, the energy distribution for unstable states in the reactions 
of decay products is called the Lorentz law, while the mass distribution for 
the particles taking part in the same reactions is called the Breit-Wigner 
distribution. 

The mechanism for the appearance of the Cauchy distribution in quantum 
mechanics in the description of unstable systems is very specific and is not 
connected in any apparent way with the common mechanisms leading to stable 
laws. In our view, it is useful for experts in probability theory to have an idea 
of how this mechanism acts. We briefly present arguments leading to the 
appearance of the Lorentz distribution. 

In quantum mechanics the state of an unstable physical system is described 
by a so-called wave function |y(t)) that has vector values and is a solution of 
the time-dependent Cauchy problem for the Schrodinger equation 


HWW) =i), YO) = Io, (1.0.4) 


where H is the Hamiltonian operator (Hermitian operator) corresponding to 
the system, and |) is a given initial value of the wave function.* 

Let {\~ez),|~~)} be a complete system of eigenvectors of the operator 
H (\~e) corresponds to the absolutely continuous component of its spectrum, 
and |y,) corresponds to the discrete component), i.e., 


H\pz)=Elye), (ve lpr) = 6(E' - E), 
H|pk) = Exlex), (pele) = xt, 


*Here we adhere to the notation common in contemporary theoretical physics. The 
inner product of vectors |a) and |b) is denoted by (a|b). 
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where 6(E’ — E) is the Dirac delta function, and ôx, is the Kronecker symbol. 

We shall be interested in the probability P(t) that at time t the system finds 
itself in the initial state |). According to the rules of quantum mechanics 
[104], 

P(t) = |(dol(t))/?. 

In solving the Cauchy problem (1.0.4) for the Schrodinger equation, we can 
assume that (polyo) = 1. In this case the Fock-Krylov theorem [116] gives us 
that 


f(t) = (volv(t)) 
= J Jex|? exp(—iErt) + J Ic(E)|2 exp(-iEt) dE, (1.0.5) 
k 0) 


where c and c(E) are the Fourier coefficients in the expansion of the vector 
lpo) in the complete system {|p E), |pk)} of eigenvectors: 
(0 9) 
Ivo) = X crl) +f c(E)|pe) dE. 
k 

Thus, f(t) can be interpreted as the characteristic function of some distri- 
bution having discrete components (probabilities of isolated values) |c,|? and 
absolutely continuous component (i.e., density) |c(E)|*. Instability of the sys- 
tem means that the probability P(t) = |f(t)|? of the system returning to the 
original state at time t tends to zero as t — oo. 

Since f(t) is a characteristic function, | f(t)| — 0 only if the discrete com- 
ponents of the spectrum of H are missing, i.e., cg = 0. In this case 


fös i “uian -ADOte. (1.0.6) 


where w(E) = |e(E)|? signifies the density of the energy distribution of the 
decaying physical system described by equation (1.0.4). 

It turns out that for a very broad class of unstable physical systems the 
densities w( E) are meromorphic functions (see {116]). For a number of reasons 
the case of a function w( E) having only two simple poles (they are complex 
conjugates in view of the condition w( E) > 0) is of greatest interest. In this 
case it is obvious that 


w(E) = A(E — Ep)? +r E>O0, 


where A is a normalizing constant, and Ep and T are the most probable value 
and the measure of dispersion (with respect to Eo) of the system’s energy. 
For real unstable systems* the ratio [/Ep is very small as a rule (10715, 


* An example of such a system is a neutron with an average lifetime of 18.6 min. decaying 
at the end of its lifetime into a proton, an electron, and a neutrino (n > p +e + v). 
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or even smaller). Therefore, to compute P(t) we can without appreciable 
detriment replace the lower limit 0 in the integral (1.0.6) by —oo, after which 
the density function w( E) and the probability P(t) connected with it have the 
approximate expressions 


w(E) x —[(E — Eo)? + r°}, 
P(t) = |f(t)|? ~ exp(—2It). 


k 
T 


It is clear from the first relation (the Lorentz distribution of the energy of 
the unstable system) that we are dealing with a Cauchy distribution, and it 
is clear from the second relation that the lifetime for unstable systems of the 
type under consideration has an exponential law. 

Thus, the Cauchy law appears here only as a more or less good approx- 
imation of the real energy distribution for unstable systems. And there are 
situations when the replacement of 0 by —oo in (1.0.6) is unacceptably coarse, 
because the corresponding law P(t) of decay of the system differs essentially 
from an exponential law (concerning this see {115]). 


§1.1. A model of point sources of influence 


The mathematical model with which we shall become familiar in this sec- 
tion is of a fairly general character, and certain types of problems from areas 
of astronomy, physics, etc., turn out to be connected with it. Of course, 
in the framework of a general mathematical model the solutions of practical 
problems most likely cannot claim great accuracy in transmitting the prop- 
erties of the phenomena under study. However, as a first approximation they 
can nevertheless turn out to be useful for understanding the features of these 
phenomena and for directing the further efforts of investigators (1:7). 

We consider some region U of finite or infinite volume in n-dimensional 
Euclidean space R”; in concrete problems it can be a region of physical space, 
space-time, phase space, etc. Dispersed in U is a countable system % of point 
objects which we conditionally call “particles”. The particles are characterized 
by the value of a parameter 0 taking values in some set ©. 

Each of the particles in % is capable of creating a “field of action” (for 
brevity, simply a “field”) described by means of an “influence function” 
v(z, y, z); v(z, y, z) takes values in some subset V of m-dimensional Euclidean 
space R™ and shows the value of the field created at a point y E€ U by a 
particle located at a point z € U and characterized by the value @ = z. By 
definition, a finite collection A = {X;} of particles in A, with the particle 
indexed by i located at the point X; and having value Z; of the characterizing 
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parameter 6, creates a field with influence function equal to 
S-{u(Xi,y, Zi): Xi € A}, (1.1.1) 

i.e., a field equal to the vector sum of the fields created by the separate 
particles in the set A. 

Both the positions X; of the particles in 2 and the values Z; corresponding 
to them are assumed to be random variables subject to a number of conditions. 

Let U; and U2 be some disjoint regions in U of finite volumes u, and 
uz. A countable system of randomly situated points in U is called a Poisson 
ensemble (see [22] concerning this concept if it has the following properties. 

1°. The numbers N; and Nz of points falling in U, and U2, respectively, 
are independent random variables. 

2°. For any k > 0 the probability P(N; = k) depends on k and the volume 
u1, but not on the shape of U1. 

3°. For small values of the volume u1 


P(N; = 1) = pu + o(u1), 

P(N, > 2) = o(uy), 
where p > 0 is a constant signifying the average density of the concentration 
of points of the system in U. 


By using these properties it is not hard to show that N, has a Poisson 
distribution with parameter A = pu}, i.e., 

P(N, =k) = Af exp(—A), k=0,1,.... (1.1.2) 

We mention two more properties of Poisson ensembles of points needed in 
what follows. 

4°. Let X,,...,Xy, be the positions of the ensemble points falling in Uj. 
Then the random variables X1, X2,... and N; are mutually independent. 

5°. The conditional distribution of the position of an ensemble point in U: 
under the condition that it falls in this region is uniform, i.e., has distribution 
density equal to 1/u1. 

We make the following three assumptions about the system 2 of particles: 

a) A is a Poisson ensemble (and, consequently, has properties 1°-5°). 

b) For any region U, of finite volume the number N; of particles falling in 
Uı, the positions X1, X2,... of these particles, and the values Z1, Z2,... of 
the parameter @ characterizing them are independent random variables. 

c) The random variables Z1, Z2,... have the same distribution law Po. 

For each region U; of finite volume the field created by the particles falling 
in U, is random, of course, and, according to the rule (1.1.1), its value at a 
point y € U is 


vı = u(y) = $ {v(X;,y, Z;): X; € Ui}. (1.1.3) 
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The field generated by the whole system 2 will be understood as a certain 
limit. With this goal we consider in R” the set of spheres Sr = Sr(y) with a 
common fixed center at a point y € U and varying radii T > 0. Then we form 
the sets Ur = Ur(y) = UNSr(y),T > 0, and denote by ur their volumes, and 
by Nr the number of particles of 2 falling in Ur. These particles generate a 
field which, according to (1.1.3), has at y the value 


vr = vr(y) = X lX; y, Z;): Xj € Ur}. (1.1.4) 


By definition, the value v(y) of the field generated at y by all the particles 
in A is equal to the weak limit of vr(y) as T — oo. To determine conditions 
for the existence of this limit and to find an analytic description of it we 
can consider the question of the limit value as T — oo of the characteristic 
function of the variable vr(y): 


fr(t) = fr(t,y) =Eexp{i(t,vr)}, teR™. 


With this aim let us compute the function f7(t) by using our assumptions 
a)-c). It is clear from (1.1.4) that vr(y) is the sum of a random number 
Nr of identically distributed random variables W; = v(X;,y,Z;) that are 
independent of Nr and of one another. Consequently, denoting by yr(t) the 
characteristic function of W1, we can write 


fo(t) = S.P(Np = k)Eexpfi(t,Wi +--+ We)} 
k=0 
oo k 
=e exp(—pur) FET oht) 


k 
= exp{pur(t) — 1)}. 
We transform the expression in the exponent of the last equality: 
dx 


ied teal) i= J (ei(tv(eu2)) — 1) P (d2) 
Urxe UT 


=p | (exptilt,v(z,y,2))} ~ 1) de Po(dz). 
Urxe 
Thus, the expression 


~T Í 


Ur x 


seems — 1| dx Po(dz) (LLS) 


is obtained for log fr(t); it can be transformed further by taking the function 
v = v(z,y,z) as a new variable and forming the measure 


ame of ETIE (1.1.6) 
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on R™, where B is a Borel subset of V C R™ and B, = {(z,z): v(z,y,z) € 
B}. The introduction of this measure clearly presupposes the measurability 
of the function v with respect to the pair (z, z) for each y € U. 

As a consequence of (1.1.5) and (1.1.6), 


onfe = [ lexp{i(t, oJ} = 1] u(dv), (1.1.7) 


where Vr = {(z, z): v(x, y, z) € Ur x O}. 

Thus, the function f7(t) obtains the canonical Lévy representation for char- 
acteristic functions of infinitely divisible distributions on R™. The question 
of the convergence of the integral in (1.1.7) does not arise, since u(Vr) = pur, 
where ur is the volume of Ur. Since the set Vr is clearly nondecreasing as T 
increases, the question of the existence of the limit of fr(t) as T — oo reduces 
to the question of the existence of the limit of the integral 


n [ ernie =u. (1.1.8) 


Here several essentially different situations can arise. Denote by /(A) the 
indicator function of an event A, and by Lr the integral (vector) 


f vi (|v| < tjaldi), 
Vr 


which obviously exists for all T > 0. Let us write the integral Jr in the form 


Jr = i(t, Lr) +f (ete) — 1 — i(t, v)I(\v| < 1)) u(dv). 
VT 

In the second term the integration over Vr can be replaced by integration over 

V if we simultaneously replace the measure u by the measure ur coinciding 

with u on Vr and equal to zero outside this set. Furthermore, ur — yp as 

T — œ, because Vr is nondecreasing as T increases. 

Since log fr(t) = Jr, fr(t) converges to some characteristic function f(t) 
as T — œ if and only if (this fact follows from the general theory of infinitely 
divisible laws in R™) 

1) Lr > L as T > ~, where L is a vector in R™, and 

2) D = fy min(1, |v|?) u(dv) < oo. 

Under these conditions the characteristic function f(t) of the limit distri- 
bution has the form 


log f(t) = i(t, L) + [wee — 1 — i(t, v)I(\v] < 1)) w(dv). (1.1.9) 


We indicate two cases when (1.1.9) takes a simpler form. 
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1*. if 
D; -/ min(1, |v|)u(dv) < oo, 
V 


then conditions 1) and 2) are satisfied and 
log f(t) = [ oito) — 1)p(dv). (1.1.10) 
2*. If condition 1) holds along with the condition 
Py = J ma, |v|)|v|u(dv) < oo, 


which clearly implies property 2), then 


toe fie) atl, Mf) + [ (ate). — fe, what dbl, (1.1.11) 

where 
M=L v u(dv). 1.1.12 
+ i (dv) (1.1.12) 


In the case we are considering the limit value L does not necessarily have an 
integral form of expression, without which M cannot be interpreted as the 
mean value of the limit distribution. If, nevertheless, such a representation of 
L exists, then (1.1.11) is easy to transform. 

As a matter of fact, the violation of condition 2) excludes the possibility 
that the limit distribution of vr(y) exists, even after this variable is also 
centered. Therefore, we consider the situation when condition 2) holds but 
condition 1) does not. In this case it makes sense to study not the random 
variable vr itself but its translate Dp = vr — Lr. The corresponding weak 
limit Č of this variable as T — oo exists if and only if 2) holds, and the 
characteristic function f of the random variable č under condition 2) has the 
form 


log f(t) = Lie —1-—<i(t,v)I(\v| < 1))u(dv). (1.1.13) 


If the condition Dz < o also holds, then vr can be translated by the 
mathematical expectation Mr = Ev7, which is finite in our situation. With 
this translation the random variable Dr = vr — Evr converges weakly to the 
random variable with characteristic function f of the form 


log f(t) = [ew — 1 -1(t,v))p(dv). (1.1.14) 


The various translations of vr are carried out in the cases when vr itself 
does not have a weak limit, i.e., in the cases when the field created by the 
whole system % of particles simply does not exist. Nonetheless, a practical 
meaning can be given to the above construction of limit approximations of the 
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distributions of the random variables yp and vr. The fact of the matter is that 
the appearance of a region U of infinite volume in the model is a mathematical 
idealization. The region U has in reality a large but finite volume. In this 
case we study random fluctuations of the field generated by the particles in 
U, with a certain constant component singled out in it. After singling out 
the constant component we can simplify the computations by an appropriate 
expansion of U, provided, of course, that this does not introduce essential 
distortions into our calculations. 

The limit distributions (1.1.9)-(1.1.11) and (1.1.13)-(1.1.14) considered 
above are infinitely divisible. It is natural to expect that in certain cases 
we shall encounter m-dimensional stable laws; this is entirely determined by 
the form of the measure defined by (1.1.6), which, in turn, depends on the 
properties of the influence functions v(z,y,z) and, to a lesser degree, on the 
properties of the distribution Po. 

Below we single out a class of influence functions generating stable distribu- 
tions. It is connected with transformation of formula (1.1.10), accompanied 
by the condition Dı < oo, and of formulas (1.1.11) and (1.1.14) obtained 
under the condition D2 < oo. 

A semicone (with vertex at the origin) in the subspace R™ of R” (0 < nı < 
n) is defined to be a set U; such that if u, € U; and c > 0, then cu; € U; and 
—cu, ¢ U,. With this definition the following assumptions are made about 
the set U and the points y for which the distributions of the random variables 
v(y),v(y), and p(y) are to be computed. 

(i) The set Uy = {u: u = x — y, x E€ U} either is itself a semicone in R” or 
is a direct product U; x U2 of semicones U; and U2 in orthogonal subspaces 
R”! and R? ni + na=n. 

The first of these two cases obviously reduces to the second if the dimension 
nə is allowed to take the value nz = 0. This convention will be followed below. 

The decomposition of R” into orthogonal subspaces corresponds to the 
notation z = (£1, 22) for vectors z € R”, where x; E€ R™ and z2 € R™. 

(ii) The influence function v(z, y, z) is 0 for all x ¢ U, is continuous with 
respect to the variable x at all interior points of U, and has the following 
structure there: 


olz, y 2) = |z1 — yi| ?D(|z2 — ller — 1 |%, '), (1.1.15) 


where the symbol “.” indicates the dependence of D on y,z and sj = 
(x; — y;)/|x; — y;|,7 = 1,2, or on some of these variables. 

The condition in (i) that the U; be semicones could be extended some- 
what by allowing U; to be both a semicone and a cone with vertex at the 
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origin. However, this extension turns out to be connected with only one pos- 
sible point y if U; is not the whole subspace R™. Anyway, the last case is 
encompassed by the assumption (i) if R"™ is regarded as a limiting case of 
an expanded semicone; therefore, the indicated extension involves only a very 
special situation. 

Further, note that in the case when U; = R”™ the influence function v 
must be invariant with respect to a translation by the variable zj, i.e., it 
must depend only on the difference z; — yj. 

In what follows, the numbers p > 0 and q > 0 will be connected with the 
dimensions nı and ng of the subspaces by additional conditions. 

We consider (1.1.10) with the condition Dı < oo, after returning to the 
original variables x and z: 


log f(t) = of Ae — 1) dz Po(dz), (1.1.10*) 


Dı = p | min(1, |v|) dx Po(dz) < oo. 
Uxe 


In the integral (1.1.10*) let us pass to a polar system of coordinates in the 
subspaces R” by setting (7 = 1, 2) 


rj = |z; — yl, a a ae he E 


where s; is a point on the surface of the unit sphere S; about the origin. We 


have that 
Ww = | en -1)dz= f J wə ds2 dsj, 
U Sı JS2 


w2 = J| otit rD, JY — 1r rri dre dri. 
0 


where 


We make a change of variable, replacing r2 by ror7, and, after changing the 
order of integration, get that 


oo 0O 
wo = J me dra | [exp(zér, ”) — irre? dri, 
0 0 


where £ = (t, D(ra,-)). 
Substitution of rı for the variable r;” gives us that 


1 oo Co , 1 
Wo = “| raat ar | (esi — ie dry. 
0 0 


If 
a= (nı + qn2)/p < 1, (1.1.16) 
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then the inside integral in the expression for w2 converges, and 


/ (e£ — 1p"! dry =T(-0)|£l® exp (—iDasgn €) , 
2 


which implies that 
r(- ad T mgr 
wa = EO) Mle, s)| exp (-iGasen(t,s)) IDI®r3?= dra, 
p 0 2 
where s = D/|D|. 
Let us introduce a measure x on the unit sphere S in R” by setting 


“i Di} = meal y |D(re, -)|* r7 dra ds; dsz Pg(dz) (1.1.17) 
ap sEB 


for any Borel subset B of S (here the integration is also over all r2 and z). 
Considering the expressions obtained for wı and w2, we can transform the 
right-hand side of (1.1.10*) to the form 


log f(t) = — | |(t,s)|* exp 4 -i~asgn(t, s)  x(ds), (1.1.18) 
S 2 


which corresponds to the canonical form of expression for the characteristic 
function of an n-dimensional stable law with main parameter 0 < a < 1. The 
finiteness of the complete measure x(S) is an additional condition present in 
the description of the n-dimensional stable laws. It is not hard to verify on the 
basis of (1.1.17) that the condition x(S) < oo is equivalent to the condition 
Di < 0d, 

The second case is connected with a transformation of the integral 


w = / (ett) — 1 — a(t, v)) (dv) 
V 


in (1.1.11) and (1.1.14) under the condition D2 < oo. Repeating the same 
arguments used above, we have that 


= o, Pa(dz) | (et) — 1 — i(t, v))dz 


U 
=p | Pa(de) | J wə dso dsj, 
(S) Sı J S2 


wW = J| lotit, ri” D) — 1 — i(t, r” D) ri r dre dry. 
0 


where 


After substitution of rar? for ro the quantity D becomes independent of r1, 
and after substitution of rı for r] ” the integral w2 is transformed to 


oo co 
Wo cs j panmi ara | Tope iêri)ry e}? dr}, 
P Jo 0 
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where € = (t, D(r2,:)) and a = (nı + qn2)/p. If 1 < a < 2, then the inside 
integral converges and can be computed. It turns out to be equal to 


I'(—a)|€|* exp(—75 a sgn £). 


Consequently, introducing the measure x just as in (1.1.17) but with the 
opposite sign, we arrive at the following expression for w = log f(t) — i(t, M) 
and w = log f(t): 


w= f \(ts)|* exp {—igasen(t,s)} x(ds). (1.1.19) 


Finiteness of the complete measure x(S) is equivalent to the condition 
Dz <00; 

The cases selected above are among the simplest ones. The stable laws 
are obtained also on the basis of formulas (1.1.9) and (1.1.13), since in the 
final analysis the question of whether or not the limit distribution is stable 
is answered only by the form of the spectral measure ju(dv), which is directly 
connected with the form of the function v(z, y, z). Laws with the value a = 
(nı + qn2)/p = 1 also appear in these complicated cases. 

REMARK 1. We mention that the field created by a set of particles is not 
necessarily homogeneous and isotropic. What kind of field this is depends 
on the structural characteristics of the influence function. Therefore, in the 
general situation the stable distributions computed for the value of the field 
v(y) at a point y or the value of the previously centered field >(y) depend on 
y, even though the main parameter a of these distributions does not change 
as y changes. 

The distributions v(y) and v(y) do not depend on y in the case when the 
influence function v has the property of homogeneity within the set U: 


v(z,y, z) = v(x — y, 0, z). (1.1.20) 


This should be understood in the sense that v(x + y, y, z) does not depend on 
y within the confines of the semicone U, x U2, which also does not depend on 
y. In this situation any interior points of the original region U can be chosen 
in the role of the points y € R” satisfying conditions (i) and (ii). 

REMARK 2. The assumption (1.1.15) about the structure of the influence 
function can be generalized as follows. 

Let R” = R™ x -x R™*, where 0 < ny <n and n; > 0, 7 =2,...,k. In 
this case each vector x € R” can be written in the form z = (z1,..., £k), £j € 
mn, 

Assume further that U, = U; X --- X Uk, where the U; are semicones in 
R", and that the influence function v is equal to zero outside U and can be 
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expressed inside U in the form 


v(x, y, z) = |z — y| P D(|z2 — yo| |z1 — yl 2 -- + |e — Ye [21 — yal ™*,*). 

(1.1.21) 
Here p > 0,q; > 0, and the symbol “-” indicates (as in (1.1.15)) dependence 
on the collection of variables y,z, and s; = (£; — y;)/|zj — yjl,j = 1,..-,k 


(or some of them). 

Just as in the case k = 2 singled out in (1.1.15), the distributions of the 
random variables v(y),/(y), and D(y) in the case (1.1.21) turn out to be stable 
with the main parameter 


a = (nı +q2Nn2 + -© + qkNnk )/P. 


The course of the arguments is somewhat more complicated than in the case 
of (1.1.15), but is basically the same. 

REMARK 3. Assume that the function D in (1.1.15) has the following 
structure: 


D(|z2 — ye|/|z1 = 1l") 
= p(|z2 — yal/|z1 — yil?, y, 2)Y(51, 82,9), (1.1.22) 


where ¢ is a real function, s; = (x£; — y;)/|z; — y;|, and 


w(—s1, $2, y) = —w(s1, 82, Y) or w(si, — s82, y) = —w)(s1, 82, y). 


Under the corresponding conditions (i.e., the condition Dı < oo for v and 
the condition Dz < oo for D) the variables v(y) and v(y) in this case have 
spherically symmetric distributions with characteristic functions of the form 


exp(—A|t|*), 0.< a < 2, 


where a = (nı + qn2)/p, and A is a constant depending on v and Pp. 

REMARK 4. The question of which influence functions lead to stable distri- 
butions in the model of point sources of influence is apparently not exhausted 
by functions of the types (1.1.15) and (1.1.21). It is therefore of interest to 
look for other types of functions v which generate stable laws. Another in- 
teresting direction for investigation has to do with the desire to weaken the 
original assumptions a)-c) in the general model. The following special case 
shows that the search in this direction is promising. 

Let 


v(z,y, z) = |zı — y1 |? D(z — y2, `), p> 0, (1-123) 


where the symbol “-” denotes dependence on s1, $2, y, and z, and let conditions 
b) and c) be replaced by the following conditions: 
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b*) For any region U, of finite volume the number N; of particles falling 
in U, is independent of both the positions X1, X2,... of these particles and 
the values Z1, Z2,... of the parameter 0 characterizing them. 

c*) The pairs (X1, Z1), (X2, Z2),... of random variables are independent 
and identically distributed. 

In the setting of the new conditions we suppose that the joint distribution 
P(dz,dz) has the structure 


P(dz, dz) = dz,Q(dz2, dz). (1.1.24) 


It turns that in the case when the conditions (1.1.23) and (1.1.24) are in 
effect the distribution of v(y) and v(y) is stable with parameter a = n,/p. 
The functions log f(t) in (1.1.10) and log f(t) in (1.1.14) are computed by the 
same scheme as used above. The parameter a = nı/p varies in the interval 
(0, 1) if the condition Dı < œ is used, and in (1,2) for D2 < o. 

If, by analogy with (1.1.22), 


u(z, y, z) = [£1 z yı| ?o(|zo Si y2|, y, z)Y(s1, $2,4), (1.1.25) 
where ¢ is real and wy is such that 
p(—s1, 82, y) = —(81, 82, y) or p(s, —82, Y) = —Ņ(81, 82, Y), 


and 
Q(dr2, dz) = Qı (dr2, dz) dso, 


then the distribution generated by v is spherically symmetric and stable, with 
parameter varying between 0 and 2. 

We give some examples of concrete applied problems. 

EXAMPLE 1. The gravitational field of stars. Holtsmark [114] considered 
the problem of the distribution of the gravitational effect of stars of various 
masses uniformly scattered in space with density p, on a unit mass at a cho- 
sen point (this problem was mentioned in the Introduction). The system of 
assumptions used by Holtsmark corresponds completely to the conditions in 
our model. Choose U = V = R?, O = R* = (0,00) and 


v(z, y, z) = Gz(z — y)/|z —y|°. (1.1.26) 


The function v expresses the familiar Newton law of attraction and gives the 
force with which a body of mass z located at a point x attracts a unit mass 
located at a point y; G is the gravitational constant. 

The form of v corresponds to the case (1.1.22). The condition Dz < œœ 
holds if it is assumed that h = E63/? < oo. Since the form (1.1.22) of v 
ensures the equality M = 0 in (1.1.11) because of oddness with respect to the 
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variable sı, we get (according to (1.1.19)) after some computations that the 
gravitational field is homogeneous and 


log f(t) = -Alt = -E phare, 


i.e., the total effect v at any fixed point p all the stars has a spherically 
symmetric distribution with a = 3/2 (1/3). 

Along with the value of the force v of the gravitational effect of the system 
of stars on a unit mass at zero, we can consider also the rate of change of v 
with time, which we denote by v’. The equality 


u= Sv Xi, 0, Zi) = Gy) Z: X| X: 


implies that 
p= G) Z{Y;|Xi|-? — 3X{Xi; YIX}, 


t 
where Y; is the rate of change of X;; therefore, making the natural assumptions 
that the triples (X;, Y;, Zi) of variables are mutually independent and X; does 
not depend on the pair Y;, Zi, we get the situation described in Remark 3 in 
the form of the conditions a), b*), and c*). 
Let U = R, U; = R3, Uz = R3, V = R, and O = R+. Here we choose the 
influence function 


v(z,y, 2z) = |z1 — y|’ p(|z2 — yal, z)¥(s1, 82), 


where 9(r, z) = Grz and (81, 82) = 82 — 381(81, 82). The joint distribution 
of Yı and Z; is denoted by P(dz2,dz). The form of v shows that it is a special 
case of (1.1.23). 

It follows from physical considerations that this distribution is spherically 
symmetric in the first argument, i.e., P(dr2,dz) = Q(dr2, dz) ds2. 

The situation we have described is easily seen to be a special case of the 
situation discussed in Remark 3. We have that p = 3 and nı = 3, whence 
a = 1. Consequently, the rate of change v’ of the force at any point of space 
has a stable distribution with characteristic function of the form exp(—A\|t]), 
i.e., Vv’ has a three-dimensional Cauchy distribution. Simple computations 
show that \ = —1tGkh/2, where 


Rix De rzQ(dr, dz) 


k = AE A |(e, %(s1, $2))|ds, ds2, é = (1,0,0). 


and 
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EXAMPLE 2. Temperature distribution in a nuclear reactor. The problem 
is as follows. In a body D with boundary T the action of radiation at random 
times 71,72,... and at random places Y;,Y2,... causes transient but very 
intense temperature flashes connected with nuclear transformations. Because 
of propagation of heat in the body, each flash arising at a time 7; and at a 
point Y; leads to an increase in the temperature at time r > 7; at a chosen 
point a € D by an amount L(r — Ti, Y;,a), where the function L(x 1, 272,a) is 
a solution of the heat conduction equation 

2 = AL + 6(a — r2) (z1) (LL27) 
Ox 1 
with boundary condition L + bðL/ðn|r = 0. Here A is the Laplace operator 
with respect to the variable a, and (x) is the so-called Dirac 6-function 
(the generalized function 6 can be taken to be the density of the degenerate 
distribution concentrated at the point x; see [81] concerning solution of this 
equation). 

The temperature T(r,a) at a point a at time r is composed of the contri- 

butions by the flashes that have occurred up to this time, i.e., 


(fa) = S {L(r -N fpl % < 7}. 


It is not hard to see that the problem is in complete correspondence to 
the conditions taken as the basis of the model of point sources of influence if 
the natural assumptions a)-c) are made about the pairs X; = (ri, Y;), with 
U, = R*t,U = D,U = U, x Ue, and V = Rt. If we assume that the 
flashes occur because of decay of atoms of a single type, then the set © can 
be taken to be empty, i.e., the function L can be taken to be independent of 
the parameter 0 characterizing the atoms. Otherwise, L must be assumed to 
depend on this parameter. We analyze the first case, which is the simplest. 

The fact that the solution L of (1.1.27) does not include physical charac- 
teristics of the material of the body being irradiated such as the coefficient of 
thermal conductivity, the energy emitted in the process of a single flash, the 
heat capacity of the substance, etc., is explained by the special choice of the 
units of measurement: length, temperature, and time; this choice allows us to 
treat the problem in a dimensionless setting (for details see [117], where this 
problem was first considered). 

Since we are assuming that the collection of pairs X; = (ri, Yi) is a Poisson 
ensemble subject to the conditions b) and c), the answer to the question of 
the distribution of the random variable T(r,a) connected with a body D of 
finite volume is simple. According to (1.1.5), the characteristic function fp(t) 
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of this random variable has the form 
log fp(t) =itMp + | J (e*} — 1 — itL) day dro, (1.1.28) 
DJO 


where p is the average density of the pairs X; in the space-time set Dx R”, L= 
L(x, 22, a), and 


Mp =e f L(x, £2, a) dx, dz2. 
pD Jo 


In studying the stochastic regularities of the variation of the temperature T 
at various space-time points (a,r) it is natural to exclude from T the constant 
component Mp and consider the quantity vp = T — Mp. 

At the point (a,r) we change the time origin, replacing r by r +w, and let 
w go to oo; this obviously corresponds to a steady-state mode. Let vp denote 
the weak limit of the variables vp under this limit. According to (1.1.28), vp 
has characteristic function 

log fp(t) = of (e+ — 1 — itL) dx dz. 
DXR? 
Of course, the distribution of Pp is now independent of both a and r. 

Subsequent success in computing fp(t) is entirely dependent on whether 
we can find the explicit form of the influence function L. 

We consider the case of a body D large enough so that it can be re- 
placed in the computations by the whole space R? without great detriment. 
Here it is natural to take the boundary condition for equation (1.1.27) to be 
L(x1,22,a) > 0 as |a| — oo. A simple verification shows that in this case the 
function 
L(21,22,a) = (4mz,)~?/? exp{—|z2 — al? /221} 
is a solution of (1.1.27). We let z = (21,22) and v(x) = L(21, 22,0), and 
compute the characteristic function f(t) corresponding to D = R. We have 
that 


log f(t) = o| J (e! — 1 — itv) dz; dao. (1.1.29) 
R3 J R+ 


It is not hard to see that this is a situation corresponding to (1.1.14) with 
influence function of the form (1.1.21). Furthermore, p = 3/2, q = 1/2,n; = 1, 
and nz = 3. Consequently, the distribution of v is stable, with parameter 
a= 57/3. 

Let us carry out a more detailed computation of (1.1.29). With this aim 
we compute the measure u(dv) = dz, dr2, taking w = v(u) as a new variable. 
We have that 


Pi} = / dz, dxr2 = an | r2 dz, dro. 
vow v>w 
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A corresponding change of variables gives us that 


P(w) = wo f 4nr2 dro dz, = kw~*/3, 
>l 

The constant k can be computed because the integration over the region 

((r2,£1): v > 1) reduces to iterated integration. Namely, 


A B A 1 3 3/2 
k= ar | dz, f rs dry = Tür (3) ’ 


where A = (47)~! and B = (—6z) log(4mz,))~!/?. Thus, 
u(dw) = Š kw 8/3 dw. 


The subsequent computation of (1.1.29) is not difficult. In summary, v has 
a one-dimensional stable distribution with characteristic function 


log f(t) = 3T (4) pk(—it)*/%, 


which corresponds in the form (B) to the values 


5 s fay" 71 


EXAMPLE 3. Distribution of stresses in crystalline lattices. As is well 
known, crystalline structures distinguish themselves by a rigid geometric ar- 
rangement of their atoms. However, this idealized representation of crystals 
turns out to be valid only in very small parts of them. Real crystals al- 
ways have various disturbances in their structure, either because there are 
sometimes extraneous atoms at a prescribed location, or because there are no 
atoms at all at these locations. 

Such anomalies in crystalline lattices are called dislocations. They may 
be scattered in the body of a crystal, but they may also be concentrated, 
forming lines and even surfaces with complicated configurations. We consider 
the case of point dislocations uniformly scattered in the body of a crystal 
with average density p. Each of the point dislocations creates an additional 
stress at the nodes of the crystalline lattice. This stress is described by a 
stress tensor v = (vx; k,l = 1,2,3; k #1) depending in general on the point 
y of the dislocation, on the point x at which the stress is being considered, 
and on the type of dislocation, which is given by the value of some parameter 
6. In many concrete cases the tensor v(x, y,4), which without damage to the 
essence of the problem can be regarded as simply a 6-dimensional vector, has 
the following form (at least for points x and y far from the surface of the 
crystal): 


v(z,y,z) =|z—y|°D(s,z),  s=(z-y)/|x—-yl, (1.1.30) 
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where the vector D(s, z) has the property 
D(s,z) = —D(-s, 2). (1.1.31) 


The total stress at y created by point dislocations located at points X; and 
having corresponding types Z; is equal to the vector sum >, U(XG, y, Zi)- 

It is completely obvious that here there is a basis for using the model of 
point sources of influence. Indeed, U is the body of the crystal in R?, and 
the point dislocations X; in U form the system 2 of particles that can be 
assumed to satisfy conditions a)-c) of the general model (familiarity with the 
specialized literature (see [139] and [108]) corroborates the possibility of this. 
The “action” of the individual dislocations is described by a single influence 
function v(z,y,z) with values in some set V C R®. The form of v coincides 
with (1.1.15) (p = 3,ny = 3,n2 = 0), and its property of central symmetry 
allows the use of (1.1.10) with the value M = 0 for computing the total effect 
v of the system 2. 

In summary, v has a stable centrally symmetric distribution in R® with 
the value a = 1: 


log f(t) = - J i(t, v)|x(dv), (1.1.32) 


where 


eh aoe J ID(s, 2)| ds Pp(dz), BES. 
6 JD/ID|€B 


This distribution is a nonspherical analogue of a six-dimensional Cauchy 
distribution. Its projections on the coordinate axes are ordinary Cauchy dis- 
tributions (with different scale parameters À in general). We remark that 
[141] contains a solution of this problem on the level of computation of the 
distributions for the coordinates of the total stresses. 

EXAMPLE 4. Distribution of the magnetic field generated by a network of 
elementary magnets.* The following scheme is encountered in certain physics 
problems. 

Actually, we consider here not one but two examples. However, they both 
involve computing the distribution of a magnetic field in the space U = R3. It 
is generated by the combined and independent action of a system of elemen- 
tary (point) magnets randomly dispersed in U with average density p. This 
system is assumed to be a Poisson ensemble satisfying conditions b) and c). 


“My attention was directed to this interesting example of an application of the model 
of point sources of influence by D. Vandev and N. Trendafilov, who are colleagues at the 
Institute of Mathematics of the Bulgarian Academy of Sciences, and I am very grateful to 
them for this. 
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In the case U = R° the influence function v(z, y, z) must be invariant with 
respect to a translation of the variable zx; therefore, it depends only on the 


difference x — y, and we assume without loss of generality that y = 0. 
The following two influence functions are considered (U = V = 9 = R°): 


v1(z, 0, z) = c—,(s x e), (1.1.33) 


——— (3(s,e)s — e), (1.1.34) 


where s = x/|z|,e = z/|z|, (s x e) is the vector product, and c is the magnetic 
constant. 

Physically, the characteristic 0 = z signifies the magnetic moment arising 
at y = 0 under the action of an elementary magnet located at the point z. 
The magnetic moment 0 = z created by the elementary magnet at z can 
take various values, depending on the individual properties of the magnet. 
With this characteristic regarded as random it is natural to assume that the 
distribution Pg(dz) is centrally symmetric, and this will be done below. 

The first influence function describes a picture arising in the theory of a 
constant (more precisely, steady-state) magnetic field created by charges mov- 
ing in the immediate vicinity of their average positions. This case corresponds 
to the so-called Biot-Savart relation. 

The second influence function corresponds to elementary magnets of dipole 
type. While in the first case we are dealing with a model for a magnetic field 
in the microcosm, on the level of elementary particles, in the second case the 
model relates to the macrocosm—the world of planets and stars scattered in 
space. If a star or planet has a magnetic field, then on the scale of the universe 
it can be regarded as a point elementary magnet of dipole type. 

The influence function v;(z, 0, z), as well as v2(z,0, z), has central symme- 
try with respect to z, i.e., v(—z,0, z) = —vı(x,0, z). They also have the same 
symmetry with respect to z, i.e., vi(z, 0, -—z) = —v;(z, 0, z). 

These properties, together with the assumption, natural for the problems 
under consideration, that the distribution Pg(dz) of the magnetic moment 
is centrally symmetric, enable us to consider a centered total magnetic field 
strength of the type Dı = (0) with characteristic function (1.1.14). With the 
assumption made about Ps(dz), the characteristic function of the variable 7 
can be given the form (recall that U = O = RÌ): 


filt) = explo | (eost) — 1) dz Po(dz). 
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The form of the influence functions allows us at once to claim that D, has 
a three-dimensional stable distribution with parameter a = 3, while > has a 
three-dimensional stable distribution with parameter a = 1. 

More detailed computations show that 


log fi (t) = —4V2mpC?/? t|”? A(t), 


where 
i 3/2 
A,(t) aj J ($c x o) |z|3/2 ds Pe(dz), 
o Js |\ ltl 
and g 
log fo(t) = -pC |t| A2(t), 
where 


- |z| ds Pa(dz). 


o= f flier) 


If the distribution Pg(dz) is assumed to be not only centrally symmetric 
but also spherically symmetric, then the functions A;(t) are constants, 1.e., 
the distributions of the are spherically symmetric stable laws; 7; has the 
Holtsmark distribution (already encountered in Example 1) (1:3), and D2 has 
a three-dimensional Cauchy distribution. 


§1.2. Stable laws in problems in radio engineering 
and electronics 


We shall become acquainted below with two examples of the occurrence of 
stable laws in models used in practice. These models are completely different 
and are combined here for purely formal reasons. 

The first example is connected with computing the performance of systems 
of radio relay stations (in engineering practice they are called radio relay 
communications lines). In a mathematical setting the part of the general 
problem to which we give our attention takes its origin from the paper [135], 
and its solution with the use of stable laws makes up the content of several 
papers, of which [136] must be regarded as the main one. The transmis- 
sion of high-quality radio communications over great distances (for example, 
television transmissions) poses for engineers not only the problem of relaying 
high-frequency radio signals that can be received only line of sight, but also 
the problem of combating noise distortions. The following seems to be one of 
the simplest models in which it is possible to trace both the effects themselves 
that arise here and the ways of analyzing them quantitatively. 

Consider a vector a € R? rotating with a large angular velocity w. The 
projection of a onto the z-axis at time t, under the condition that its mo- 
tion began from the position defined by the angle p, is the periodic function 
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y(t) = |a|cos(wt + p). The oscillatory excitation at the output of the radio 
transmitter is described by the function y(t), in which the quantity |a| is the 
amplitude of the radio signal (its power is proportional to |a|?), w is its fre- 
quency, and wt + ¢ is its phase (at the time t). The quantities |a| and w stay 
constant at the transmitter output, while transmission of the useful signal is 
accomplished by modulating the phase of the signal, i.e., by changing y. 

If the radio signal is received by the receiver without change, then its infor- 
mation content—the phase shift ~—could be recovered without difficulties. 
However, in real circumstances the radio signal comes to the receiver in a 
distorted and weakened form. This is due to the fact that multiple beam 
scattering of the signal takes place after output from the transmitter (even if 
one tries to give the signal a pencil beam direction). As a result, part of the 
radiation does not hit the receiver antenna because of the restricted size of 
the latter, and this leads to a drop in the power of the signal, which is natu- 
rally greater, the farther the receiver is from the transmitter. The part of the 
radiation reaching the antenna travels the distance not in a single trajectory, 
but in a pencil of trajectories. The different trajectories, in turn, cause the 
components of the pencils to reach the receiver with modified phase. Since 
the changes are small in themselves, it follows that in combination with the 
large frequency w and the fact that there are many such changes we have a 
distribution that is close to uniform with respect to the phases of the vectors 
associated with them. As a result, the total two-dimensional vector X has the 
nature of a random vector with a circular normal distribution. This implies 
that the length |X| of X has a Rayleigh distribution with density 


p(z) = D~*xexp(—2?/2D7), xr > 0, (12.4) 


where D? is the variance of the components of X. The weakening of the 
power of the signal and the transformation of the determinate (with respect 
to the value of the amplitude) radio signal into a random quantity does not yet 
cause serious difficulties for distinguishing the useful signal—the phase shift 
p. The complications begin after the incursion of the noise effects created by 
the receiver apparatus itself. 

The influence of the noise can be represented by adding to X a two- 
dimensional random vector Y having circular normal distribution with vari- 
ance of the components equal to o?. The vector X + Y has phase differing 
from that of X by an angle w determined by 

[Yt 
IX + Y,| 
where Y, and Y; are the radial and tangential components of Y with respect 
to X, and €(Y;) is a random variable determined by the direction of Y+ and 


tany = EC Y«), =n LYT, (1.2.2) 
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taking the values +1 and —1 with probabilities 5: Since the whole picture 


evolves in time, the vectors X and Y and the angle ù determined by them 
depend on t, i.e., are random processes. The processes X(t) and Y(t) (and, 
consequently, w(t)), which are connected with different sections of the relay, 
can be regarded as independent and stationary. 

The total distortion ~ of the information phase p at time t after passage 
of N sections of the relay is determined (if the delay in passing from section 
to section is ignored) by the equality 


tan w(t) = tan(y,(t) +--+ Yn(t)), —a<pe<n. (1.2.8) 


The distribution of 7);(t) is symmetric for each t, as is clear from (1.2.3). 
Therefore, Ey; (t) = 0. The quantity w(t) obviously has the same property. 
With this property taken into account, the value of the standard deviation of 
Y(t) from zero on some interval of time with fixed length is usually taken as 
a measure characterizing the level of noise in the transmission. For example, 
on an interval of 1 sec, 


1 
w= fae 
0 


The estimation of Y? is a fairly complicated problem in the analytic setting. It 
can be simplified by a certain coarsening. Namely, first, according to (1.2.3), 


1 
p< [ (x(t) +--+ b(t)? dt 


pne i 2 i l i 
"2s, voae+ df W(t), (t) dt. (1.2.4) 


Second, considering that the stationary processes 7); (t) are independent and 
that Ew,(t) = 0, which implies that 


1 
J A ALA ET 


we can ignore the second term in (1.2.4), taking the sum 


1 
S / Y? (t) dt (1.2.5) 


as an estimator of V2. The terms of this sum, in turn, admit a simplified 
approximate expression if the quantities €; = 0;/D,; are taken to be small, 
where ø? and DŽ? are the variances of the components of Y; and X; connected 
with the 7th part of the relay. Indeed, according to (1.2.2), 


1 1 
€;|U;| 
wr(t a= | arctan | ———__+~_-. | di, 
J i) 0 [Vt eU} 
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where U; = Ymj/o;,U); = Yp;/o;, and Vj = X;/D,;. Consequently, 


woas | amas f 21)" dt. 
J, a o |V; + e;U;l? TVR 

The next step in simplifying the estimator V2 has to do with the circum- 
stance that the vectors V;(t) and U,(t) vary with sharply different intensity. 
For example, V,(t) is practically constant on a time interval of the length 


being considered, while U,(t) performs an enormous number of rotations on 
the same interval (of the order of g For this reason, 


2 
favor wp WORA 


E|U,(1)|? = e7|V,|-*. 


e 
M 


a 


Consequently, a simplified estimator of Y? can be represented as a sum 
2 —2 
ii D ejl Vy] ’ 
F 


where the £; are constants small in magnitude, and the V; are independent 
random vectors having normal distribution with covariance matrix with iden- 
tity. Thus, as an estimator W? of the random variable Y? we get a sum of 
N independent random variables ¢7|V;|~* whose distribution functions have 
the form (as follows from (1.2.1)) 


FAs) = exp(—e7/2z), + > 1), 


Since the terms ¢4|V,|~? are small, while their number is sufficiently large, 
the distribution of ¥? can be sufficiently well approximated by an infinitely 
divisible law whose spectral function is not difficult to compute with the help 
of Theorem B in the Introduction. Namely, H(r) = 0 if z < 0, and 


H(z) x X (F;(z)— 1) <5 ode a] 


if x > 0. This spectral function corresponds, NEET to Theorem C.1, to a 
stable distribution with parameters a = 1 and 8 = 1 (see Remark I.4 about 
this). We present a more accurate approximation of the distribution of Y2, 
clothing this computation as a limit theorem. 

Consider a sequence of series of numbers €2,,...,€2,,n = 1,2,..., such 
that )), Cll = 1 and 6, = max; Eñ; — 0 as n — oo, and the sequence of 
centered sums 

Zn = Xni +09 4+ Anm — An 
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of MEAN random variables Xņ„; with distribution functions F,,;(r) = 
exp(—€ ol z),z > 0. The choice of centering constants A, will be made later. 

The Lapis Stieltjes transforms of the distributions F„;j can be expressed 
with the help of the Macdonald function K,(z) as follows (see [27], 3.471(9)): 


Eexp(—sX,;) = 2 se? Ky (2y/ se? ;), a>, 


Consequently, 


log Eexp(—sZ,) = sAn + ` log Eexp(—sXn;) 


J 
= sAn + J log {2 se? ;Kı( ENER }. (1.2.6) 
7 


If we take into account the fact that 
2r Kı(2£) = 1 + z? log z? + (2C — 1)x? + O(2* log x”) 


as s — 0 (see [27], 8.446; C is the ae constant), along with the requirements 
noted above about the quantities ¢?.., then the right-hand side of (1.2.6) can 
be transformed to the form 


nj? 


sA, + slogs + (x -1+ Soe, vee s + O(6, log(1/6,)). 
r 
Choose 
=- D loge?, + 1 — 2C. 
j 


Then (1.2.6) gives us that 
log Eexp(—sZ,) = slog s + O(6, log(1/6,)). 


The function exp(s log s) is the Laplace-Stieltjes transform of the distribution 
G(zx,1,1) (see, for example, Theorem 2.6.1); hence, 


P(Z, < xz) > G(z,1,1), n —> o. 


From this, setting By = JS e* in the original problem and letting cẸ; = 
e*/BN, we can assert that if dy is small, then 


P(W? < 2) = P(W? < 2) = G(xBy! — An, 1,1). 


The second example has to do with computing a model for a homogeneous 
electrical line, which can be, for example, an electrical cable or the circuit of 
cascaded four-pole networks (14). A number of properties of such a line can 
be described with the help of a so-called time function F(t, A),t > 0, which 
shows the reaction of a line of length A > 0 to a perturbation of “unit shock” 
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type at the initial time. The function F/(t, À) is called the pulse reaction of the 
line, and its Fourier transform f(w, A) is called the frequency characteristic of 
a homogeneous line of length ..* 

It is known from electrical circuit theory that when a homogeneous line 
of length A = A; + AQ is partitioned into sequentially connected sections of 
lengths A; and Ag, its time function F(t, A) is formed from the time functions 
F(t, à) and F(t, Az) of the separate parts via their convolution, i.e., 


F(t, Aq + X2) = F(t, A1) * F(t, A2), 


which is equivalent to multiplication of the corresponding frequency charac- 
teristics: 


flw, ài + Az) = f(w, Ar) fw, Az). 
It follows from this that for any A > 0 
flw, A) = fr(w, 1). (1.2.7) 
Connected with the function f(w, A) are the quantities 
a(w, A) = —Re log f(w, A) = Aa(w, 1), 
b(w, A) = Imlog f (w, A) = Ab(w, 1), 


called, respectively, the damping and phase of the frequency characteristic. 

It turns out that cases when the time function of the line is nondecreasing 
on the time axis t > 0 are not rare. If, in addition, the damping at zero 
frequency (w = 0) is equal to zero, then the time function F(t,A) can be 
regarded as a distribution function concentrated on the half-line t > 0. In 
this case (1.2.7) implies that F(t, À) is an infinitely divisible distribution with 
characteristic function f(w, A) of the form 

oo 
log f(w,A) =A (is +f (ert = 1) dH(u)) , y > 0. 
0 

For frequency characteristics of this form the phase b(w, A) can be recovered 
from the damping a(w, A) to within the term wyà. 

Thus, if a(w, A) = Ac|w|*, where 0 < a < 2 and c is a positive constant, 
then the corresponding damping of the phase has the form 


b(w, A) = Ac(wy + tan Fa|w|*sgnw), 17> 0, 


*Comparing the interest represented by this example with the scope of the exposition, 
we do not go into details here about the physical content of the functions F(t, À) and 
f(w,A). Those desiring to acquaint themselves more thoroughly with these concepts can 
turn to [132], where we borrowed the example (the notation was changed) and where there 
is a list of literature relevant to this problem. 
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while the time function F(t, à) connected with such a frequency characteristic 
is a stable distribution, 1.e., 


F(t, A) = Ga(t,a, 1,7, cA). (1.2.8) 


Certain forms of cables which have a power character of damping are known 
in electrical circuit theory. For example, a(w,A) = Ac\w|!/? for the so-called 
noninductive and coaxial cables. Consequently, according to (1.2.8), the time 
function of such cables has the form 


F(t, A) = Ga(t, 3,1, 7, cd) = 2[1 — ®(cA(t — eyA)~*/*)), 


where ® is the distribution function of the standard normal law. The pulse 
reaction F¥(t, A) in this case has the simple explicit expression 


P cA E a E 
A kT a a. exp] - 7 (x — cyA) Ja 


81.3. Stable laws in economics and biology 


In the 1960s stable laws began to attract the attention of scholars working 
in the area of economics, biology, sociology, and mathematical linguistics, due 
to a series of publications by the American mathematician Mandelbrot and 
his successors (see [118]-[128]). The fact is that statistical principles described 
by so-called Zipf-Pareto distributions had been empirically discovered fairly 
long ago in all these areas of knowledge. Discrete distributions of this type 
have the form 

pk = ck 172, k> 1, oO, 


while their continuous analogues (densities) have the form 
ala) = ego?" z>a>Q0. 


Mandelbrot called attention to the fact that the use of the extremal stable 
distributions (corresponding to the value 3 = 1) for describing empirical prin- 
ciples was preferable to the use of Zipf-Pareto distributions for a number of 
reasons. It can be seen from many publications, both theoretical and applied, 
that Mandelbrot’s ideas have more and more won the recognition of experts. 
In this way the hope arose of confirming empirically established principles in 
the framework of mathematical models and at the same time clearing up the 
mechanism for the formation of these principles. 

Below we present an example of a stochastic model that can be used to 
explain a so-called law. 

In 1922 the English biologist Willis published a book [254] dealing with the 
study of statistical principles in evolution processes. One of the main results 
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in his investigations was the discovery of the following principle observed in 
nature. 

Biological species are commonly taken as the primary elements in the clas- 
sification of living organisms. The species are then combined into coarser 
groups called genera. 

We consider a sequence of genera in the animal or plant world, ordering 
them according to the number of species occurring in them. Then we calculate 
how many genera in the total number contain a single species, two species, 
etc.; let these numbers be M1, Mo,..., etc., and let M = Mı + Mo +--: be 
the total number of genera involved. Next, form the sequence of frequencies 
Pe = Mk/M (k = 1,2,...), which allows us to represent the probability of 
finding exactly k species in a randomly chosen genus. 

In the language of probability theory, the discovery of Willis consisted in 
seeing that for genera containing sufficiently many species, i.e., for n > no 

Š pe œ An", (1.3.1) 
k>n 
where A is a constant. 

In other words, the probability of finding in a genus at least n species 
decreases with increasing n at the rate 1/yn. 

In 1924 the English mathematician Yule [253] was able to find a theoretical 
basis for the relation (1.3.1) in the framework of a stochastic model that can 
be included in the theory of branching processes. 

We demonstrate below (in a way different from that in [253]) how the 
Willis-Yule law could be explained. Most interesting here is the occurrence in 
our model of a stable law and its connection with the principle (1.3.1), which 
is traditionally associated with the Zipf-Pareto distribution. 

A model of a random branching process with two types of particles will be 
taken as a basis for the arguments. 

Let us consider the reproduction process as time passes for particles of two 
types. This goes as follows. At the beginning of the process we have a single 
particle of type Tı. During a unit of time this particle turns into a collection 
_ of 410 particles of type Tp and u11 particles of type Tı. The particles of type 
_ Tp remain unchanged, but the particles of type T; can be further transformed. 
The numbers p10 and u11 are random variables. We impose some conditions 
on this random process of reproduction of particles. 

1. The transformation of each of the particles of type Tı takes place in- 
dependently of its history and independently of what happens with the other 
particles. 

2. The joint probability distribution for the values of the random variables 
H10 and u11 at the time of transformation of a particle of type Tı stays the 
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same for all particles of this type and does not depend on the time at which 
the transformation takes place. Furthermore, the total number p10 + H11 
of “descendants” produced in the course of a single transformation cannot 
exceed some constant number h. 

3. The mean number 6 = Ep10 of particles of type To produced in a single 
act of transformation is positive, while the mean number of particles of type 
T; is equal to 1. Moreover, c = Var (411 > 0. The last condition, in particular, 
does not permit a particle of type Tı to produce with probability 1 only one 
particle of type T, at the time of transformation. Together with the condition 
Ei, = 1, this means that with some nonzero probability a particle of type 
T, does not produce any particles of the same type. Since particles of type 
Ty are not transformed, further transformation of the given branch is thereby 
terminated. 

It is known that transformations of a single particle of type Tı subject to 
the above conditions ultimately (with probability 1) cease. Its transformation 
process gives a certain random number U of particles of type To, called final 
particles. 

Let us assume that there are initially n particles of type Tı. In the process 
of their transformations and the transformations of their “descendants” (this 
process, as mentioned, stops with probability 1) there appear U;,...,U» final 
particles produced by the first, second, etc., initial particle of type T. 

According to conditions 1 and 2, the random variables U; are independent 
and identically distributed. Let us form a normalized sum of these random 
variables: 


Vn = (U1 +++: +U,)(26n?/c)~?. 


It can be proved that the distributions of the random variables V, converge 
as n — œ to the stable distribution with parameters a = 1/2, 8 = 1, y = 0, 
and A = 1 (to the Lévy law). 

In the recognized opinion of experts, the diversity of biological species 
came about as the result of evolution of living beings in the course of strict 
natural selection. The Earth’s climate has changed, both globally and in 
regional parts, and this has created new requirements on plants and animals— 
to survive they have had to acquire new qualities. 

This has been accomplished due to the variability of characteristics in new 
generations and due to natural selection of the fittest of their representatives. 
If we attempt to formalize this selection, then in simplified form the picture 
will remind us of the random branching process described above. By a frac- 
tional part we understand some portion of the population united by some 
characteristic important from the viewpoint of survival, and we understand 
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transformations of parts to be the changes arising in a long series of genera- 
tions; the fractional parts of type Tọ (the final parts) should be interpreted 
as the “descendants” that were able to secure qualities needed for stable ex- 
istence, while the parts of type Tı should be interpreted as “descendants” 
lacking such qualities and thus doomed either to extinction or to relatively 
rapid change. 

Of course, the model with final parts is an idealization of the phenomenon. 
Even well-adapted species are subjected to subsequent changes. But this hap- 
pens much more rarely and slowly than with the groups which “feel” discom- 
fort in their condition and are pressed toward further variability. Therefore, 
a model in which fractional parts of both types To and T; are transformed is 
in closer correspondence with the real situation: at the end of its existence 
a To-part turns into “oo To-parts and fo; T\-parts, and a T\-part turns into 
u10 To-parts and u11 Tj-parts. It is assumed that 


Euoo=1, Envir =1, O0<c1 < c= Vat pir < co, Siig <h, 


as in the original model, and that the numbers 6 = Eyjo,e = Euo1, and 
o = Var “oo are positive but small in absolute value. 

In this variant of the process the transformations of the fractional parts do 
not cease with probability 1, as was the case in the original process. Changes 
take place over an arbitrarily long period of time with positive probability. 
This property of the process corresponds more closely to real evolutionary 
processes. At the same time, if the value of £ is small, then the average lifetime 
of parts of type To is correspondingly large. The total number of parts in this 
model increases on the average. However, on any bounded interval of time 
this growth depends on the value of the sum 7 = £ + 6 and can be arbitrarily 
slow if this sum is sufficiently small. 

All the foregoing agrees well with our ideas about the flow of evolutionary 
processes. 

Before passing to an explanation of the Willis- Yule law we dwell further on 
a certain feature of the new variant of the model. 

In the new variant there are no final fractional parts. Therefore, one can 
speak only of the number V,,(A) = U;(A) + --- + Un(A) of long-lived parts 
of type To at a time A sufficiently long after the initial time. 

The principal way in which we see the second model as differing from the 
first is that To-parts are not final but simply long-lived in it. The quantity 
£ = Ey) will be regarded as the main parameter in the second model, because 
the first model is obtained by fixing the values h, 6, and c and letting £ — 0. 
In other words, in a certain sense the second model exhibits continuity with 
respect to variation of €. 
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Therefore, it seems quite likely (and is actually corroborated by compu- 
tations) that the distribution of the sum V,,(A) depends continuously on € 
as € — 0. This can serve as a basis for nevertheless replacing the second 
model, which better reflects the evolutionary process actually in progress, by 
the first model in computations, since the latter is more convenient on the 
analytic level. The analysis of the behavior of the distribution of the sum 
V,,(A) in the first model can, in turn, be replaced by an analysis of the be- 
havior of this sum corresponding to A = oo. The last condition means that 
we are considering the distribution of the total number of final fractional parts 
generated by the original n parts of type Tı. 

The validity of this replacement in analyzing the asymptotic behavior of 
the distribution can be justified. Of course, the error is all the smaller, the 
greater A is. Thus, for our purposes we can use the limit theorem given for 
the first model with respect to the distributions of the variables V,,. In refined 
form it asserts that as n — co 

P(V, > z) =P(Y < 2z)(1+ o(1)), 
where Y = Y (4,1,0, 1) and o(1) is a quantity tending to zero uniformly with 
respect to z. 

When considering a set of species (parts of type To) within the scope of a 
single genus it is natural to assume that they all have a common root (i.e., 
are generated by a single part of type Tı). Between species and genus, of 
course, there were also intermediate forms not constituting independent units 
of classification. We think of them as n — 1 initial parts of type T; that in the 
final analysis produce U2+---+U, parts of type To. The initial part of type Tı 
also produces U, parts of type To. Together they produce W = U1 +---+Un 
parts of type To. The distribution of this sum is precisely the distribution of 
the number of species in a genus. 

Since 6 is small while the quantity c is bounded above and bounded away 
from zero, the ratio B? = 26n?/c can be regarded as a quantity bounded both 
above and below (by suitably choosing n). Then the number of species in a 
genus has a distribution that can be approximately expressed by the stable 
Lévy law: 

P(W > zB?) = P(V, > 2) S P(Y > 2). 
If we consider that for large x 
1 E l i% 
G(z, 5,1) = P(Y IP) A a i, 
then we obtain the Willis-Yule principle (1.3.1): 


P(W > z) x P(Y > 2B?) x ago 
T 


CHAPTER 2 


Analytic Properties of Distributions 
in the Family G 


In contrast to the next chapter, where specific properties of strictly stable 
laws are considered, the material in the present chapter relates to all the stable 
laws. The analytic basis for the presentation here is the explicit expression for 
the characteristic functions of the distributions in the family G in one of the 
two forms (A) or (B). The specific form of parametrization with which the 
formulation of a particular property is associated is either stated explicitly or 
indicated by a corresponding subscript. The absence of such a specification 
means that the formulation is associated with the form (B). 


The stable laws (and the random variables having such laws) with param- 
eter values y = 0 and à = 1 in the form (B) will be called standard. The set 
of standard stable laws is denoted by Go. 


The concept of a standard law is connected with a definite form (we chose 
the form (B) as most suitable for this purpose from an analytic point of view) 
because the sets of standard laws with values y = 0 and A = 1 do not coincide 
in the various forms. It is not difficult to establish the values of the parameters 
of standard laws in the forms (A) and (M) by using the formulas for passing 
from the form (B) to the forms (A) and (M). 


: In each of the forms of parametrization for the family G or its subfamilies 
-= %W and Go there is an indicated domain of variation of the parameters corre- 
sponding to this form, which we shall call the domain of admissible values of 
= the parameters. 


82.1. Elementary properties of stable laws 


The very simple expression for the characteristic functions of stable laws 
enables us at once to note a whole series of interrelations between them. With 
these relations as a basis, it becomes possible, in particular, for us to reduce 
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the study of the analytic properties of the distributions in G to the study of 
the properties of distributions in various subfamilies of ©. 

It was already mentioned in the Introduction that in many cases it is con- 
venient and intuitive to assign to relations between stable distributions the 
same form of relations between random variables having these distributions.* 
This form is employed for formulating the properties given below (with the 
exception of the last property). The reverse passage from relations between 
random variables to relations between the corresponding distributions is suf- 
ficiently simple and does not require additional explanations. 


PROPERTY 2.1. Any two admissible parameter quadruples (a, 3,7, A) and 
(a, B, y’, Aà’) uniquely determine real numbers a > 0 and b such that 


Y (a, B, y, à) = aY (a, B, ~, A") + àb. (2.1.1) 


In the form (A) the dependence of a and b on the parameters is expressed as 
follows: 


a = (à/X')"*, 


p= q = Y (A/A') 2? ifa#l, 
o y— Y + ŽßBlog(à/à') uv a1. 


We single out an important particular case in (2.1.1). Let y = 0 and 
A = 1. Then 


Y (a, B, y, A) £ A1/*Y (a, B) + A(q + bo), (2.1.2) 


where bo = 0 if a £1, and bọ = 2 Blog A Hasi, 

The equality (2.1.2) shows that À signifies a scale parameter, while + sig- 
nifies a translation parameter (more precisely, a pure shift of the distribution 
is a linear function of +). 


PROPERTY 2.2. For any admissible parameter quadruple (a, 3,7, A) 
Y (a, —B,-7,) = -Y (a, 8,7, A). (2.1.3) 


The useful content of this property consists, in particular, in the fact that 
it allows us without loss of generality to consider the distribution functions 
G(z, a, 3,7,) with the one (according to our choice) additional condition 
that the sign of the argument z, the parameter 8, or the parameter 7¥ is 
preserved. 


*In this section the absence of an index means that the relation is valid for both forms, 


(B) and (A). 
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PROPERTY 2.3. Any admissible parameter quadruples (a, Bk, Yk, Ak) and 


any real numbers h and cy, k = 1,...,n, uniquely determine a parameter 
quadruple (a, _ 7 such that 
a, B, y, À aces , Bk, Yk, Ak) + h. (2.1.4) 


In the form (A) the dependence x the quadruple (aœ, B, Y, à) on the chosen 
parameters and numbers 1s as follows: 


A= hi Ak |cx|®, 
k 
AB = Ý Ak Belce|* sgn ce, 
k 
Ay = X AKC + ho, 
k 


where ho = h if a Æ 1, and ho = h — 2y Ak Breck log |cx| if a = 1. 
We mention some special cases of Property 2.3 that are of independent 


interest. 


PROPERTY 2.3.a. An arbitrary admissible parameter quadruple (a, B, Y, A) 
and any 3’ and 6" with -1 < B’ < B < B" < 1 determine unique positive 
numbers c' and c” and a real nea l such that 


Y (a, B, y, A) = PY (a, B') + cY (a, 6") +1. (2.15) 


In the form (A) the dependence ; the parameters and the numbers is expressed 


as follows: 
J- (x 8" — el Ne (x B-B' oa) 
pv — B' ? B" — Bl B' ; 


= Ay x = I, 

àq + +4 2 (p'e loge + 0" loge”) Yasi, 
Choosing 3’ = —1 and 8” = 1 and using the equality Y (a, —1) = —Y (a, 1), 
= we get that any random variable Y(a, ß8,Ņy,à) can be expressed as a linear 


combination of two independent random variables of the form Y (a, 1) (in the 
sense of the equality wn). 


PROPERTY 2.3.b. For any admissible il quadruple (a, B, Y, A) 


Y (a, B,y, A) — Y (a,b, 7, ) = Y (a, 0,0, 22). (2.1.6) 


PROPERTY 2.3.c. Any admissible parameter quadruple q = (a, b, Y, A) 
uniquely determines an admissible quadruple q* = (a, 3*,y*,A*) such that 


Y (a, B,,d) — $Y (a, 8,7, 4) — 3Y (a, B, y, A) = Y (a, 8", 7",A"). (2.1.7) 
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In the form (A) the parameters of q* are expressed in terms of the parameters 
of q as follows: 


j= 2-9 i=) i 
fp a a a Te 
j ipaa’ GE 1421-e ~ 3)’ 


y*=Oifa#l1 and y* = —(Blog2)/z ffa=1, 
* = (1 +219). 


It is not hard to see that the random variables on the right-hand sides 
of (2.1.6) and (2.1.7) have strictly stable distributions. This feature of the 
transformations of the independent random variables (having an arbitrary 
stable distribution) on the left-hand sides of (2.1.6) and (2.1.7) turns out to 
be very useful in the problem of statistical estimation of parameters of stable 
laws (Chapter 4). 

The above properties are all proved according to a single scheme. We 
consider the characteristic functions of the left and right sides of (2.1.1)—(2.1.7) 
(for example, in the form (A)) and, writing out their logarithms, compare the 
coefficients of the linearly independent functions of the variable t. As an 
example let us prove (2.1.4) in the case when a # 1. We have that 


À (ity — |t|% + 2t|t|/*~" 8 tan Za) 
= ith + X. Ak (itykck — |cx|*|t|* + it|t!°~ ‘cx |ce|°~ 1 Bx tan Za). 
k 


Comparison of the coefficients of the functions zt, |t|, and 7t|t/*~! gives 
relations determining the values of the parameters 8, y, and à. It is obvious 
that y and à have admissible values. If we write out the parameter ( in the 


final form 
>p Ak|Ck|®% 6k Sgn Ck 
B DE SE etal lel e 
ok XK |Ck|® 
then the satisfaction of the condition |8| < 1 also becomes clear, because 
[Px | < 1. 


PROPERTY 2.4. For any admissible quadruple q = (a, B, Y, A) of param- 
eter values and any admissible quadruple q’ = (a’, 3’, Y, A) such that qd —> q 
(1.e., the corresponding parameters converge) 


Yu(a’, ip y’, x) s Yula, f, 7; A). 


The convergence Æ is understood as weak convergence of distributions. 
However, taking into consideration that stable distributions have densities, we 
can at once conclude that the convergence £ can be replaced by convergence 
in the sense of nearness of the corresponding distribution functions in the 
uniform metric. 
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PROPERTY 2.5. Let (a,8,7,A) be an admissible parameter quadruple, 
and let med Y be the median of the random variable Y = Y (a, B, Y, à). Then 
the following inequality holds for any 0 < s < a: 

E|Y — med Y |° < 4(2A)*/°T (1 — s/a)I'(s) sin Zs. (2.1.8) 

REMARK. We need not consider the case a = 2 corresponding to a normal 
law, since (as elementary computations show) we have here that for all s > 0 

E|Y — med Y |° = 421-9/2(2A)°PT(1 — 8/2) (s) sin Zs 
< 4(2A)°PT(1 — /2)P(s) sin Zs. 
An obvious consequence of (2.1.8) is that E|Y |° < oo for any 0 < s <a. 
The proof of (2.1.8) is based on the following expression of the absolute 


moment E|X|* of an arbitrary random variable X with the help of the char- 
acteristic function f(t) of X: 


E|Xx|° = 2F(1+s)sin zs f (1 — Re §(t))t°-} dt. (2.1.9) 
0 
It is obtained by substituting X for x in the equality 
\z|° = 2T (1 + s)sin zs f (1 —costx)t~*—! dt (2.1.10) 
0 


and taking the expectations of both sides. The equality itself is established 
by the same device as used to compute the same integrals in Theorem C.2. 
Namely, for real p > 0 we first compute the integral 


J (1 -e ®t tdt = p'T(1-— 8)s7t; (2.1.11) 
0 


then we extend it analytically to the half-plane Re p > 0 and use the continuity 
of this expression on the imaginary axis. After substitution of the value 
p = —iz the real part of (2.1.11) coincides with the integral in (2.1.10) of 
interest to us. 

Let us consider independent random variables Y’ and Y” with the same 
distribution as Y. According to (2.1.6), the random variable Y’ — Y” has 
characteristic function exp(—2A|t|*). From this, on the basis of (2.1.9), we 
get 


Ely’ — Y"|° = 2 (1 + s)sin zs f (1 — ep -2M e e dt 
0 


= 2(2d)*/P(1 — s/a)P(s) sin 58 (2.1.12) 
The right-hand side of (2.1.12), hence also the left, is finite for 0 < s < a. 
Further, the easily verified inequality 


P(|Y’-Y"| >) > ¿P([Y' — med Y"| > z) 
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gives us the inequality 
E|Y — med Y |° < 2E|Y’ — Y"°, 


which together with (2.1.12) leads us to (2.1.8). 

We conclude this section with an observation which has to do both with 
the relations given above among the random variables Y (œ, 8,Ņ,à) and the 
relations between the random variables Y(a,@) and Z(a,p) to be introduced 
(+-1)* in Chapter 3. 

The following theorem is given in [58] after one of the results in the note 
[17] is generalized. 


Let Yı z Y> = Y(a,0,0,) be independent random variables and let L = 
(Lı, L2) be an arbitrary two-dimensional vector with P(Z, + L2 = 0) = 0 that 
is independent of Yı and Y2. Then 


a Yı = | ———_ Yo = Vy. 
(ee) i Li+ Ll2 7. : 


It is not hard to see that this relation extends one variant of (2.1.4) to the 
case when the coefficients cą are assumed to be random variables. Arising 
naturally in this connection is the question of how general the situation is 
when various parameters in relations between random variables can also be 
regarded as random if desired, without thereby disturbing the validity of the 
relations. It turns out that such a transition from nonrandom to random 
parameters is possible under relatively slight restrictions, and it is based on 
the following general fact. 


LEMMA 2.1.1. Let h = h(a) be a function defined and measurable on 
a set A C R™ and taking values in R”, and let X(a) and X'(h) be random 
variables whose distributions depend on respective parameter tuples a and h 
and which are connected for eacha € A by the relation 


X(a) = X"(h, (a)). 


Then, for any random variables V £ V' with values in A and such that V is 
independent of X while V’ is independent of X', 


X(V) É X"(h(V’)). 


*Raised numbers in this form refer to the corresponding notes in the Comments at the 
end of the book. 
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The proof is elementary. We have that 
Eexp(itX(V)) = J Eexp(itx(a)) Pr (da) 


= | Eexp(itX"(h(a))) Pr» (da) 
= Eexp(itX'(h(V’))). 


This obviously implies the lemma. 
Let us illustrate the foregoing by an example. Let 


V =(a,B,A,¢1,---,€n) EV’ = (œ, BN, c),...,€,) 


be random variables such that, with probability 1, a # 1, 6, and à vary in the 
domain of admissible values, 5°, |ck|* = 1, V is independent of Y1,..., Yn, 
and V’ is independent of Y. Then, according to (2.1.4) and Lemma 2.1.1, 


¥(a’, 6,0, 2’) = Y cx Yu (a, 8,0, 2). 
k=1 


82.2. Representation of stable laws by integrals 


Theorem C.2 established an explicit form for the characteristic functions g 
of stable laws. In this section we turn to the question of an expression for the 
stable distributions themselves, on the basis of the expression for the functions 
g in the form (B). Due to Property 2.1, it is possible without loss of generality 
to consider only standard stable distributions. What is more, in considering 
the density g(z, a, 3) or the distribution function G(z, a, 3) it suffices to solve 
the problem of representing these functions in the more restrictive situation 
when z > 0 or 8 > 0, since, according to Property 2.3, 


g(—2, a, f) = mE: a P 
G(—z, a, 8) = 1 — G(z,a,—8), 
g(—t, a, b) = g(t, a, —B). 


As already noted in the Introduction, the function |g(t)| is integrable on 
the whole real t-axis, and hence the density g can be expressed with the help 
of the function g by the inversion formula 


a(z,a,8) = z | olta ) a 


_1 Re | ett g(t, a, —B) dt. (2.2.1) 
0 
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It is natural to begin the search for diverse variants of representations for 
stable laws by analyzing the inversion formula (2.2.1), which gives a first 
variant for expression the densities g(x, a, 8) of standard distributions. 

Substituting in (2.2.1) the expression given in the Introduction for the 
function g(t, a, 3) in the form (B), we get that 


g(x, a, B) = ` Re f 


in the case a Æ 1, and 


als, 1, 8l = - Re | exp { —itz -= xt — ibt logt} dt (2.2.1b) 


Co 


exp { ite — t“ exp (—i58K(a)) \ dt (2.2.1a) 


in the case a = 1. 

Although, as we see, the functions g can be written in a simple form,* there 
are expressions in elementary functions for the densities corresponding to them 
in only four cases. That these stable laws actually have the densities given 
below can be verified by direct computation of the corresponding integrals in 
(2,2.1a) and (2.2.1b) (?-*). 

1. The Lévy distribution 


ja ee eae if z > 0, 
i 0 if x <0. 


2. The Cauchy distributiont 
mE 1:0) = t(n? /4 toyi 
3. The Gauss distribution (with variance o? = 2) 


g(x, 2,8) = e214 12, / x. 


4. The case obtained from the first by symmetric reflection: 
1 1 
g(z, 90 =) = gia, 2) I). 

For transformations of the integral in (2.2.1) with the purpose of obtaining 
other expressions for the density it is desirable to have an analytic extension 
of the function g(t, a, 3) into the complex z-plane with semiaxis Re z = t > 0. 
When considering analytic extensions of the functions g, the case a = 2 
corresponding to a normal law can be skipped as obvious. For it the function 
g(z, 2, 8) = exp(—2z?) is entire. 

*It should be noted that this section does not include all the integral representations of 
stable distributions which we decided to put in the book. For methodical reasons some of 
these representations are spread over other sections (see 2.3, 2.5 and 3.4). 


tThe expression for the density in the form (B) differs from the traditional form of 
expression, which corresponds to the form (C): gc(z, 1,0) = +(1 + g2)-1, 
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If a < 2, then the function g(t,a,) can also be extended analytically 
from the half-line t > 0 (or from t < 0), but it then has the branch points 
z = 0 and z = œ. Therefore, to single out a principal branch of it we must 
make a cut along some contour joining 0 and oo without intersecting the real 
t-axis. In what follows we need an analytic extension of g in the half-plane 
Rez > 0, and in this connection we make a cut in the lower half-plane along 
the ray {z:argz = —37/4}. The analytic extensions of g(t,a, 3) from the 
semi-axes t > 0 and t < 0, where it is given by different formulas, are denoted 
by g*(z) = g*(z,a,B) and g~(z) = g-(z,a, 8). It is not hard to verify by 
using the expression for g(t,a,) in the form (B) that the extensions have 


the forms 
— z*exp(-158K(a)) ifaFl, 


Hia 
log g @)={ — 2(5 +126 log z) tasi 
(where z% and logz are understood to be the principal branches of these 
functions) and 


(2.2.2) 


S — (—z)®“ exp(iZßK(a)) ifa #1, 
logg” iz) = r — ame) wasl: 


An elementary transformation of the functions (2.2.2) and (2.2.3) enables us 
to give them the desired form (e(a) = sgn(1 — a)): 


— e(a)(—iz)* exp(—i3K(a)( - 1)) ifa #1, 


(2.2.3) 


logg" (2) = (—iz)(i(@ - 1)3 + Blog(-iz)) feat, 4) 
— — €(a)(—tz)*exp(t5K(a)(8—-1)) ifa #1, 
ogg ye | (~iz\(i(1-8)E + Blog(—iz)) Rasi PMA 


Comparison of (2.2.4) and (2.2.5) shows that the analytic extensions gt 
and g7 do not coincide, except in the already mentioned simple case when 
œ = 2 and the case when a < 2 and @ = 1. If we consider the equality 
g(t, a, 3) = g(—t,a,—), then we can add the case when a < 2 and 8 = —1 
to the indicated case. The following assertion summarizes the foregoing. 


LEMMA 2.2.1. Analytic extension of g(t,a,() from the whole t-azis to 
the complex z-plane is possible only in the case a = 2, while analytic extension 
to the complex plane with a cut along the ray arg z = —37/4 is possible only 
in the cases a < 2, B = 1 anda < 2, B = —1, which correspond to extremal 
stable distributions. Moreover, 

: _ [ -ela)(-iz)? ifa#1, 
log 9" (2, a, 8) = (—iz)log(—iz) wfa=1. 228) 

REMARK. The function a(z) = log g*(z,a,1) has the following integral 

representation in the half-plane Im z > 0 (see the proof of Theorem C.2, where 
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similar formulas have already been encountered): 


af (7u —1—izu)u-(¢t) du ifa > 1, 
=a) Js 
Yale) = aa = J (em — Tyr) dy ifa <1, 
= 0 
J (ete — 1 — iz sin u)u™? du fo= 1. 
0 


We pass to the construction of the integral representations of interest to us 
for the density of g. The transformation of the integral 


0 


in (2.2.1) will be implemented by changing the contour of integration, in such 
a way that the value of the integral J, or at least the value of its real part, does 
not change. The next lemma is needed for justifying the changes of contour. 


LEMMA 2.2.2. LetO < e< 7/4 be a fized number. Then as r — œ or 
ro 0 


Q(C,) = | solei anan 


for any sequence of contours of the following form: 
1. Ifx > 0, a < 1, and B has any admissible value, or if x is any real 
number, a= 1, and B > 0, then 


G, = {ekl =r, 0 Sarge = 7 =e. 
2. [fx >0,a>1, and B has any admissible value, then 


T n Klaj r 
C, = { z: |z| = r, — < cee R 
fz |z| Mo, te Sarg 58 <<} 


PROOF. In all the cases considered the length of the contour C, is at most 
nr. Therefore, 


|Q(C,)| < mr max(|e*** gt (z, a, —B)|: z € C,) 
= trexp{max(U: z € C,)}, 


where U = —z Im z + Relog gt (z, a, —8). 
In the case when a < 1 and z > 0 we set z = re’? and get that 


U = —rrsin p — r“ cosa(y + 73/2), 
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whence 
max(U: z € C,) = max(U:0 < p < T — £) 
< max(U:0 < p < $(1—a)) 
+max(U: §(1—a) <p <r-e) 
< r“(1 — cos F$a(2 — a)) — zr min(e, $ (1 — a)). 
For large values of r the last expression can obviously be estimated from above 
by the quantity —2logr, and this implies that |Q(C,)| < t/r — 0 as r > o. 
In the case when a = 1 and 8 > 0 
U = —(Blogr + z)rsiny —r(F + Bp) cos g. 
From this we find, as in the preceding case, that 
max(U:z € C,) = max(U:0 < p< 7-€) 
< max(U:0 < p < 7/6) + max(U: 7/6 < » < 1 -¢) 
< 2(4 + |z|)r — Br log r min(e, 5). 
Consequently, for large values of r 
max(U:z € C,) < —2logr, 


and hence |Q(C,)| < t/r — 0 as r — oo. From these estimates it is also clear 
that |Q(C,)| = O(r) as r > 0. 
The case when a > 1 and x > 0 is handled in exactly the same way. 
REMARK. The assertion of the lemma is preserved if the contours C, are 
replaced by parts of them. 


LEMMA 2.2.3. In the complex z-plane with a cut along the ray arg z = 
—3r /4, consider a family {I } of contours satisfying the following conditions: 

1. Each contour T begins at the point z = 0. 

2. None of the contours T intersect the cut. 

3. By moving from the point z = 0 along a contour T one goes to infinity 
in such a way that from some point all the points z ET have values of their 
arguments in the intervals 


O<argz<7-eé, 
e€ — 7m(1 — BK(a))/2a < argz < r(1+ GK(a))/2a, 


where 0 < a < 2, |B| < 1, and £ > 0 ts an arbitrarily small number. 
Then, for any contour of the indicated form and any pair (a # 1,8) or 
(a= 1,8 > 0) of admissible parameters, 


j eraa, -b)dz = | e*tg* (2, a, =b) dz. (2.2.7) 
0 r 
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PROOF. For a chosen contour I and for each r > 0 let us also define the 
following auxiliary contours: 

T, is the part of I obtained by moving from the initial point z = 0 to the 
first point z, where I intersects the circle |z| = r. 

L, is the part (disjoint from the cut) of the circle |z| = r between the points 
z = r and z, traversed in the direction from z = r to zy. 

We form the closed contour D, r = [r, R] U Lr U (T R\T,)U L, (an overbar 
indicates that a contour is traversed in the opposite direction). 

Since the function w(z) = e’?*g*(z, a, —) is anlaytic in the domain where 
the contour D, r lies, Cauchy’s theorem gives us that 


R 
| wdz= | wdz+ f wdz- | wdz+ | wdz- | wdz= 0. 
DR r LR TR ay Ly 


Invoking Lemma 2.2.2 (see the remark after it), we find that 


1 wdz+ | wdz- f wdz—=0 aR-w,r-—0. 
LR T Di 


s 


Consequently, (2.2.7) is valid. 


THEOREM 2.2.1. Let x be an arbitrary real number, and let a and 8 be 
admissible parameter values for a standard stable distribution. 
1. Ifa < 1 and z > 0, then 


glz 0, B) = Smf exp { -zu — u“ exp (-iF a(1 +8))} du 


1 oO 
z im f exp{—zu — u“ exp(—impa)} du. (2.2.8) 
0 


2. Ifa=1 and B>0, then 
JE 


1 ii l 
g(2, 1,8) = im f exp { -zu — Bulogu + iz 


3. Ifa >l, then 


jlt wp) = eRe f exp f -u° — itu exp (i30) = i50) du 


(i+ Bju} du. (2.2.9) 


1 (© e] 
ai= im f exp{—u®“ — ruexp(irp) + imp} du. (2.2.10) 
0 


The proof of each of the formulas (2.2.8)—(2.2.10) reduces to the following 
actions: 

a) choice of a contour I needed for transforming the integral in (2.2.7): 

b) justification of the change of the contour of integration in the integral 
in (2.2.7) with the help of Lemma 2.2.3; and 
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c) replacement of the variable of integration on the right-hand side of 
eae 

The positive part of the imaginary axis is taken as the contour I in cases 
1 and 2. The use of Lemma 2.2.3 here does not require any additional expla- 
nations. For such a contour the change of variable z = iu, u > 0, leads to the 
corresponding expression for the density. 

In case 3 the ray [ = {z: arg z = 70/2} is chosen for the role of the contour 
I’, with the subsequent change of variable z = uexp(iz0/2). 

The transformation of the integrals are very simple in all the cases consid- 
ered. For example, if a < 1 and z > 0, then (2.2.4) gives us that 


a(2,0,8) = + Re f E ee, =e 


-lm f exp f-zu -u° exp (izal +8))} du 


1 ia M T 
— Im exp { -ru -u exp (-i5a(1 +8) } du. 
T 0 2 
COROLLARY 1. Ifa <1, then for any admissible B and any x > 0 
g~ d 
G(z,a,B)=1- - | e 7“ Im exp(—u%e7 70a) SE 
T Jo u 
Ifa = 1, then for any admissible 3 > 0 and any real x 
LF TT du 
G(z,1,8)=1-—- exp(—ru — Gulog u) sin [Za + B)u —. 
T Jo 2 u 
If œa > 1, then for any admissible 3 and any z > 0 
Giza, p) = Eil 6) JA e~“" Imex (—cueineay 
T, ’ a 9 = 0 p u 
In the case œ < 1 these equalities are obtained by integrating the corre- 
sponding expressions for the density. In the case œ > 1, besides integrating 


the density from 0 to z, we should use the expression for G(0, œ, 3) given by 
(2.2.30). 


COROLLARY 2. Ifa #1, then for any admissible values of 3 


g(0, a, 8) = =I (1+4) cos (Fan) (2.2.11) 


In the case a > 1 this equality is obtained by substituting the value z = 0 
in (2.2.10) and computing the resulting integral. 

In the case a < 1 we first consider the expression (2.2.8) for the density 
g(x, a, —|3|), z > 0, and pass to the limit as z — 0. The integral obtained in 
the limit is computed. Then we use the fact that 


g(0, œ, 8) = g(0,a, —8) = g(0, a, —|4)). 
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In the case when a = 1 and 8 > 0 there is no elementary expression for the 
value j ge 4 
. [7 u 
g(0,1, 8) = -| exp(—(u log u) sin Za + Byul — 
T Jo 2 u 
of the density at zero (%3). 
The following integral representations of stable laws are connected with the 
parametrization in the form (A). Let u = $tan(ma/2) and y = z + |2|'~%p. 


THEOREM 2.2.2. 1. Ifa < 1, then for any admissible 3 and x #0 such 
that y > 0 


ga(x t p|z|'~%, a, B) 


JORON fh du. (22.2) 


2. If a=1, then for any real x #0 and any B £0 
2 
ga (2+ 230g |, 1,3) 


Bru 
=f exp {- z] “Gls ae i(t +a) } du (2.1.13) 


PROOF. In the first case we have, according to (2.2.1a), 
ga(z 7 p e, Q, p) 


1 oo 
aa Re | exp(—izy — 2% + tz) dz 
0 


= — Re | exp {i= — |z|~*[u® — ipu(u%—* — D} du. (2.2.14) 
0 


Since œ < 1 and y > 0, it is possible to rotate the contour in the integrals 
(2.2.14) through the angle —7/2 without changing their real parts. The result 
iS (2.2.12). 

The second case is analyzed similarly. Taking (2.2.1b) as a basis, we get, 
after the corresponding transformations, 


2 
JA (: + ~ Blog |z|, 1.8) 


=i ed eek ese, 
r Re | exp { Tall P a (14i +i- -|8| log z) dz. (2.2.15) 


Rotation of the contour of integration in (2.2.15) through the angle —7/2 
leads us to (2.2.13) (%4). In both cases the rotation of the contour is justified 


§2.2. REPRESENTATION BY INTEGRALS 73 


by standard arguments already used above more than once, with the help of 
Lemma 2.2.2. ) 

It is interesting to note that (2.2.13) can be obtained from (2.2.12) by 
passing to the limit as a — 1. The proof of it is based on the following 
arguments. First of all, we can conclude on the basis of Property 2.2 that in 
the case 8 #0 

ga (y, a, B) = ga(y*,a,|G)), 
where the quantity 


y* = ysm p = pa + |8| tan T aļz|1-° 
|8] 2 

obviously becomes positive for values of œ sufficiently close to 1. Therefore, 
without loss of generality we can consider only the case 3 > 0 in (2.2.12). It 
is not hard to see that in this situation the integral in (2.2.12) is transformed 
into (2.2.13) as a — 1. It remains to establish that the left-hand sides of these 
equalities also approach each other. The connection between the parameters 
of the forms (A) and (M) and the specific joint continuity property of stable 
distributions in the form (M) with respect to all the parameters noted in the 
Introduction show that 


QA = QM, Ba = Bm, 

ga(x,a, b) = gm (zx — u, a, p), 

ga(z, 1,8) = gm(z, 1, p), 

gm(z,a,3) > gm(z,1,8) asa. 


(2.2.16) 


Moreover, it is not hard to verify that 
ulet —1) > 2Blog|z| asa — 1. (2.2.17) 
It follows from (2.2.16) and (2.2.17) that 


ga(x + plz|*~*, a, B) = gu (x + w(|z|1~* — 1), a, 8) 
—> 9M (x 2 Blog |z], 1, 8) 
= 9A (x T 2 Blog Ear 1, 8) . 


REMARK 1. The case a = 1, 3 = 0 included in the second part of the the- 
orem corresponds to the well-known Cauchy law, and thus it can be excluded 
without damaging the formulation. 

REMARK 2. The case a > 1 was not considered in Theorem 2.2.2. It 
seems that a direct analogue of (2.2.12) does not exist here. However, there is 
a formula that can serve as a complement to this theorem in a certain sense. 
For methodical reasons it is included in the next section (see (2.3.6)). 
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REMARK 3. There are no representations analogous to (2.2.12) and (2.2.13) 
for the distribution functions. More precisely, there are such representations, 
but they are not very simple, because the integrands no longer contain only el- 
ementary functions, but also functions such as the incomplete gamma-function 
and the integral exponential function, which must furthermore be considered 
in some part of the complex plane. 

All the integral representations of stable distributions given above have 
one feature which sometimes greatly complicates their use, for example, in 
constructing estimates in tabulating the values of the density and of dis- 
tribution functions, etc. This feature is that the functions under the integral 
sign in the corresponding representations oscillate, i.e., change sign infinitely 
many times. It is natural to ask whether there are representations of stable 
distributions free from this deficiency. The answer turns out to be positive. 

The following notation is used below (some of it was used earlier): 


ela) =sgn(l—a), 0=K(a)/a, 0* =Osgnz, 
C(a,0) =1-4(14+ 4)(1+ e(a)), 


U.(v.6) = (= zalp +8) at= a cos 3 ((a — 1)p + a) 
oh ida al cos Fp cos Fp 


1 
Uily,B)=5 cee aus exp G (6 + 5) tan zo) 


THEOREM 2.2.3. The densities of standard stable distributions can be 
written as follows: 
1. Ifa #1 and zx #0, then for any |8| < 1 


’ 


1/(a—1) 


1 
o(z,0,8) = ——— | Uele, expla? -DUalo 6") de. 
_@* 
(2.2.18) 
2. Ifa=1 and B £0, then for any x 
1 
gia, 1,8) = sae? | Vile B)exrt-e*"ti(p,B)} do (2.2.19) 


We mention that excluding the cases a 4 1, x = 0 anda = 1, 8 = 0 in 
the formulation of the theorem involves no loss of generality for its assertion, 
because the first of these cases was consdered in (2.2.11), while the second 
corresponds to the well-known Cauchy law (%5). 

The proof here, as in the preceding theorems on integral representations, 
has the inversion formula (2.2.1) as a starting point: 


g(—z,a,—-8) = Re f explizz + Y(z,a,—ß))dz. (2.2.20) 
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The function (z, a, 3) = log g+ (z, a, 8) is taken in the form (C), i.e., 
Vlz, 0,3) = - 2 exp(-i$0a) ifa #1, 


—- 32 = ifzlogz if a= 1. 
Without loss of generality, we assume that z > 0 in the case a #land B>0 
in the case a = 1. In the complex z-plane consider the contour 


I = {z:Im(tzz + p(z,a — B)) = 0, Zk < arg z < Th 


where k = —6 if a £ 1, and k = -1 if a = 1. For a < 1 andĝßĝ = —1 we 
have that k = 1. According to (2.2.6), Y(z,a,1) = —(—iz)®, and hence the 
sum tzr + w takes real values on the positive part J+ of the imaginary axis, 
i.e., T = I+, which explains the meaning of the condition m/2<argz < 1/2 
arising in this case. 

In the remaining cases the contour [ is a curve whose equation in polar 
coordinates (z = re’®, tk/2 < p < 1/2) is obtained by using the form (2.2.21) 
of the function w: 


(2.2.21) 


rrcosp —r*sina(y+70/2)=0 ifa#1, 


222 
rrcosy + Brlogrcosp — (t/2 + By)rsinp =0 ifa=1. l ) 
The solution r = r(y) is found in the explicit form 
sina(p + 79/2) \ O7% a4 
r(p) = T COs q (2.2.23) 


exp (—2/8+(o+7/26)tany), a=1. 
The contours described by the functions (2.2.23) differ in their form, depend- 
ing on the choice of the parameters (a, 3) to which they correspond. They 
can be divided into four groups. 


TEF 


FIGURE 1. Form of the contours I% of integration (in polar 
coordinates). 


L {a<1,8==1), withT =f; =f, 

2. (a < 1, 8 # +1) and (a = 1,0 < B < 1). The contours in this group 
begin at zero and go to infinity as p approaches 7/2. They approach the 
point z = 0 at different angles, depending on the quantity k, and they have 
the form I; if k = 1, the form T% if 0 < k < 1, and the form T; if —1 < k < 0. 
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3. œa > 1. Here the contours I begin at zero, which they approach at the 
angle 1/2, and go to infinity as p approaches —76/2, i.e., they have the form 
Faf 6>0, orl; tf p< 0. 

4. (a < 1, 8 = 1) and (a= 1, 8 = 1). The contours Ig in this group go to 
infinity as p approaches 7/2, and they begin at the point —77, where 

Pa l (alp Va Fazi, 
~ | exp(—z-1) ifa=1. 

With the help of the contour I'g we form a new contour S by following the 
rule that S =Te if 841, and S =TgNI™ if 8 = 1, where [* = {z:Rez = 0, 
-r < Imz < 0}. A feature of the contour [* is that 

Im(izz + w(z,a,-1))=0 forzel”, 
which clearly implies the equality 


1 
= Re | (izz + w(z,a,—-1))dz=0. 
T p» 
Therefore, in the case 8 = 1 
$ Re | explizz + %)dz = > Re | explizx + w) dz. 
n S n T 


Lemma 2.2.3 makes it possible (by the last equality) to replace the contour 
of integration in (2.2.20) by T, and we see that 


g(—2,a,—B) = E Re | exp(zzz + w) dz 


1 
= [ exp{Re(izz + w)} d(Re z) 


1 
- ~ | exp(-W(e)) dlr cos), (2.2.24) 
À 
where 
_ f zrsinp +r®cosa(p + 7/2), a #1, 
ees zrsing + Brlogrsing + (7/2+ By)rcosy, a=1. eee) 


We obtain the final form W (p) by substituting in (2.2.25) the expression 
found for r(y) in (2.2.23). 
Suppose that a # 1. According to (2.2.23), 


sr = pon alp + 70/2) 
cos p 
and, therefore, 
sin a(y + 70/2) 
cos p 
y? cos|(a — 1)p + tað/2) _ 
Cos p 


W(p)= r° sinp + r™cosa(y + 76/2) 


r/e- U (2/7, 8). 
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If a = 1, then the equality Glogr = —x + (By + 7/2) tan follows from 
(2.2.22), and this equality in (2.2.25) gives us finally that 


W(p) = rrsinp + (—z + (Be + 2/2) tany)rsiny 
+ (By + T/2)r cosp 
= r(cos p + sin? p/ cosy) (By + T/2) 
= exp(—2/8)U1 (29/7, B). 


To conclude the proof we must clear up the form of the differential d(r cos p). 
Let us consider the case a # 1. By (2.2.22), zrcosy = r®sina(y + 70/2). 
Consequently, 


rd(r cos p) = ar“ cosa(p + 70/2) dp + ar?! sin a(y + 70/2)dr 
= ar“ cosa(y + 70/2) dp + azr cos yr tdr 
= ard(r cos p) + a|zr sin p + r* cosa(p + 70/2)|dy. 


As is clear from (2.2.25), the second term in the last sum is equal to aW (p) dy. 
Thus, 


rd(r cos p) = ard(r cos p) + ar®/(°-YU, (p/m, Ody, 


which implies the desired form of the differential d(r cos p): 


dir cosp) = i (DU, (29/1, 8)dy. (2.2.26) 


The case a = 1 is handled similarly. There the result of the computations is 
the equality 


d(r cosp) = W (p)dp/B = (1/8) exp(—2/8)U1 (29/7, 0)dp. 


To conclude the proof it remains for us to substitute the expressions W (p) 
and d(r cos p) into (2.2.24) and consider that, as we go along the contour I in 
_ the direction of increasing r, the angle p changes from kr/2 to 1/2 if a < 1, 
and from 7/2 to 70/2 if a > 1. 

| Consequently, if a Æ 1, then (2.2.24) can be written in the form 


acela) 
t(l- a) 


n/2 
g(—z,a,—3) = / r! -1)y (2/7, 0) 


6/2 
x exp{—2°/(- DU, (2p/m, 6)} dy, 


and if a = 1, then in the form 


nm /2 
g{—2#,1,—8) = zf : U, (20/7, 8) exp{—e7® PU (2, 7, B)} dy. 


—1/2 
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The proof was carried out for densities g(—z,a,—) under the additional 
conditions z > 0 in the case a 4 1 and 8 < 0 in the case a = 1. The general 
case can be reduced to the cases stipulated by these conditions. Indeed, 


g(x, a, b) = 9(—|a1,;4,.-2°); 
g(x, 1, 8) = g(—z*,1 — |2|), 


where 3* = Bsgn z and z* = xsgn 8. If we substitute the quantities |z| and 
B* (in the case a Æ 1) and the quantities z* and || (in the case a = 1) in 
the expressions found for the densities, then we get the respective equalities 
(2.2.18) and (2.2.19) after replacing the variable p by mø/2. In the last case 
it must also be taken into account that 


-z*/|B|=—2z/B and Uy(y,8) = Ui(—9, -8). 


We make some remarks about the theorem just proved. 


REMARK 1. The distribution functions G(x, a, 3) of standard stable dis- 
tributions can be written in the following form: 


l. 


G(z, a, 3) = C(a, 0) + a [ exp{—2°/(*-D U7, (p, 0)} dy (2.2.27) 
0 


iai andz>0, 
2. 


1 
G(z,1,8) = 5 J  exp(=e™7 901 (¢,8)) do (2.2.28) 


if a= Land f >00, 


The cases a # 1, z < 0 and a = 1, 8 < 0 can be reduced to the corre- 
sponding cases a Æ 1, z > 0 and a = 1, 8 > 0 with the help of the following 
equality, which is a consequence of Property 2.2 and is valid for the forms (B) 
and (A): for any real x and any admissible parameters a and 8 


G(—z,a, 3) + G(z,a,—-8) = 1. (2.2.29) 


The proof of (2.2.27) and (2.2.28) reduces to integration of the correspond- 
ing inequalities (2.2.18) and (2.2.19). For example, consider the case a # 1, 


§2.2. REPRESENTATION BY INTEGRALS 79 


x >0. Then 
1 — G(z, a, B) = J g(u, a, 3) du 


1 foe) 
= a | de | (- a gle ) 
2 =ø r a-—l 


== [. 5 +e(a)) — expla taD) dip 


= ta +0)(1+ela))— = D exp(—2°/(°-D U7, ) dp. 


The case œ = 1 is analyzed similarly. 

REMARK 2. Although (2.2.27) was proved for the values z > 0, there is 
nothing to prevent us from passing to the limit as x — 0 and computing 
the value of G(0, œ, 8). Passing to the limit under the integral sign does not 
require any additional justification. As a result, in the case a Æ 1 


G(0, a, 8) = 5(1— BK(a)/a). (2.2.30) 


Unfortunately, in the case a = 1, 8 > 0 no values of a are known such 
that G(a, 1, 8) can be expressed as a combination of elementary functions of 
B. However, the expression 


1 
G(0,1,8) = aa J Uile, P)exp(-U1 (e 6)) do (2.2.31) 


obtained from (2.2.28) also turns out to be useful in the problem of statistical 
estimation of the parameter 8. 
REMARK 3. If a < 1 and 8 = 1, then for all z < 0 


G(z,a,1) = 0. (2.2.32) 


Since G(z,a,1) is a nondecreasing function tending to zero as r > —00, 
_ (2.2.32) is a consequence of the equality G(0, a, 1) = 0 (see (2.2.30)) (26). In 
turn, (2.2.29) and (2.2.32) imply that in the case when a < 1 and 83 = -1 


G(z,a,—-1)=1 (2.2.33) 


for all x > 0. 

REMARK 4. The forms (2.2.18) and (2.2.19) of the densities of standard 
distributions, together with properties (2.2.32) and (2.2.33), enable us to draw 
the following conclusion. 
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The probability measures of standard stable laws are concentrated on the 
following intervals: 

a) the positive semi-axis if a < 1 and 2 = 1; 

b) the negative semi-axis if a < 1 and 8 = —1; 

c) the whole real line in the other cases. 

Furthermore, the density g(x, a, 3) is positive at each interior point xz of 
the interval on which the probability measure is concentrated. 

REMARK 5. We get an unusual way of writing a normal distribution in 
the case a = 2 from (2.2.18) and (2.2.27). (Recall that a standard stable 
oda with a = 2 corresponds to the unbiased normal law with variance 
a? = 2: see the beginning of §2.2.) a 


NE 8 
27 x 2 sin? p sin? p 


z. et /2 dt = a ex (- i ) a t>0 
V2r J- T Jo . 2sin? p í 


The equalities (2.2.18) and (2.2.19) obviously make it possible to represent 
derivatives g'”) (x, a, 8) of any order n in analogous form. These representa- 
tions, like the representations of the densities themselves, are far from unique. 
The fact is that the method used to prove Theorem 2.2.3 can be applied to 
the derivatives obtained from (2.2.20): 


™) (x, a, 8) = ~ Re | exp(izr + Y(z, a, —9)) dz 


i.e., it is possible to replace the integration over the semi-axis z > 0 by 
integration over the contour I’, and so on. As a result, 


g™ (x, a, B) = - > exp(—W())r” (r sin(n + 1) (2 T 5) 


+rcos(n + 1) ( + =)) dy. (2.2.34) 


It turns out that this representation has a rarely expressed individuality and 
differs essentially from what is obtained by n-fold differentiation of (2.2.18) 
and (2.2.19). We illustrate this by an example. 
= 1, > 0, and let 


gi/(a-1) fal, 
ig} = l 
exp(—z/3) ifa=1, 


—— a= i Q 5 
a(y) = sin Y sin ay 
exp(y cot p) iesi 


sin Y 
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B |sinay|*/"-™ /asin(2 — a)y 
bp)=4 1-a| sing ( sin ap -1) ERR 
exp(—2 cot p)(2p cot p — 1) if a= I, 
T 5" tazi, f= 1. 
0 otherwise, 
0 ifa <1, @=-1, 
v= | -nga if 8 > 1, 
T otherwise. 


With the help of (2.2.23) the equality (2.2.34) can be transformed into 


g'(z,a, 6) = =K°(2) | “Helepi-hila@nde (2.2.35) 


At the same time, differentiation of (2.2.18) and (2.2.19) gives us that, in the 
same notation, 


v 


(2, 4,8) = ok! f (1 — ak*a)aexp(—k%a) dy, 


u 
where c = a/{r|1 — a|} if a £ 1, and c = 1/r ifa = 1. 

Even a quick glance shows how different these representations of g'(x,&, 3) 
are. In the case a # | the representation (2.2.35) was first considered in [34], 
where it was used in studying the question of unimodality of stable laws. The 
main result of that paper is presented below (Theorem 2.7.5). 

The equality (2.2.35), in turn, can serve as a starting point for obtaining 
new expressions for the function g(x, œ, 3). However, as a rule, the formulas 
thus obtained turn out to be essentially more complicated than those given 
in Theorem 2.2.3. They have the simplest form in the case when a = 1/n, 
n=1,2,.... For example, 


otz. 1,1). = J ba~*(1 + ka) exp(—ka) dy, 
0 
gegi) = i f ba™t (ar73? + 3a?! + 6ax~!/? + 6)exp(—r7 12a) dy. 
0 


It is not yet clear how such representations might turn out to be useful in 
studying the properties of stable distributions. 


§2.3. The duality law in the class 
of strictly stable distributions 


This is the name for the relation connecting the distributions with pa- 
rameter œ > 1 and the distributions with parameters a’ = 1/a in the 
class W. 
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It should be said that this relation is not peculiar just to stable distribu- 
tions. There is an analogue of it (more precisely, of the part of it connecting 
the extremal stable distributions) in the set of infinitely divisible laws. The 
duality law proves to be a valuable addition to the elementary properties of 
stable distributions. Thanks to it, we can better form an idea of the structure 
of these distributions. 

The presentation of specific properties of strictly stable distributions (the 
duality law is included in these properties) is commited to the next chapter. 
However, for reasons of a methodical nature we shall become acquainted with 
the duality law in this chapter. In the course of the present section we do not 
have to go outside the class W, so it is natural to parametrize stable laws in 
the form (C). It will be indicated explicitly when we have to use other forms. 

Without loss of generality we can consider only part of the distributions 
in W by fixing the value of the scale parameter: let Ac = 1, which corre- 
sponds to the densities gco(z, a, 9) and distribution functions Go(z, a,@) with 
characteristic functions of the form 


go(t, a,0) = exp(—|t|* exp(—7z7a0/2)), 
where 0 < a < 2 and |0| < min(1,2/a — 1). 


THEOREM 2.3.1. For any pairs of admissible parameters a > 1, 0 and 
any x > 0 


a(l — Go(z, a, 0)) = Go(z~*, a’, 6’) — $(1 — 0’) 
= Gco(z~%, a’, &’) — Go (0, a’, 0’), (2.3.1) 
where the parameters a’, 6’ are concerned with a,@ by the equalities 
a = Ha, 1+ 6 =a(1+ 4). (2.3.2) 
In terms of densities, (2.3.1) is equivalent to the equality (%7) 
gco(z, a, 0) = 2 1~%go(z~*, a, 0’). (2.3.3) 


The proof of the theorem is based on (2.2.27), which in the form (C) has the 
same form as in the form (B), because Gg(z,a,3) = Go(z,a,@). Consider 
the case a > 1; then 


1 
a(l — Gc(z,a,0)) = al exp{—2%/(*-D I, (p, 6)} dy. (2.3.4) 
0 


The right-hand side of (2.3.1) can be given an analogous form: 
Go(x-%, a’, 6’) — Go (0, 0’, 6’) 


1 
=3/ exp{—(2-9)°/" ua (p, 6} de. (2.3.5) 
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To establish (2.3.1) it suffices to transform the right-hand side of (2.3.4) in 
such a way that it takes the form of the right-hand side of of (2.3.5). Since 
a’ = 1/a, it follows that ~ 


o/(a-1) Es (aye fee — 1). 


Further, 


Uale,8) = ( 


i e lee cos $[(a — 1)p + ab] 
7 sin alp +0) — 


sin Fa(yp + ~~ cos 5 [(a — 1)p + a0] 


us 
cos 5 cos 5 


cos 5 
We perform a change of variable, setting 
a(o+0)=1-Kk, ie, p= -a’kK+a/ —8. 
With this substitution the limits of integration become 1 and 1 — a(1 +6) = 
—6’. Moreover, we get that 
(a-—1l)p+a0=(a-1)p+1-—K-ap=1-K-o 
=1-k+a'k-— a +0 = (æ —-1)k+a'’, 
p =-a'k +1- a'h =1- a(k +0). 
Substituting these expressions in (2.3.4) and considering that 1/(1 — a) = 
—a’/(1 — a’), we obtain the equality 
Uap, 9) = Ua: (k, 0’). 
As a result, (2.3.4) is transformed into (2.3.5). 

The case a = 1 need not be considered separately, since the validity of 
(2.3.1) follows from the joint continuity of the distributions in W with respect 
to a and @ in the whole domain of admissible values of these parameters (a 
property mentioned in the Introduction). Thus, it suffices for us to pass to 
the limit as a —> 1 in (2.3.1). 


In the case a = 1, 0 we have that a’ = 1 and 0’ = @. This case corresponds 


to the Cauchy distribution with linearly transformed argument. Namely, 
=i 


o1 T . 7 2 
jola 1,2) = = COS 79 (1 2z sin gf tt ) i 
1 1 2-sinZ0 
=-+-— — }. 2:8:0 
Gc(z, 1,9) 5 + arctan ( TAT, ) (2.3.5a) 


We now demonstrate the possibilities of using the duality law in the prob- 
lem of constructing integral representations for densities of complex argu- 
ments. The representation to be proved below relates to stable laws with 
parameters a > 1 and serves as a complement to the representations (2.2.12) 
and (2.2.13) in Theorem 2.2.2. 
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THEOREM 2.3.2. Suppose that a > 1, B is a pair of admissible parame- 
ters corresponding to the form (A), and x 1s a real number. Let 
B* = cot(m/2a) - cot[tQ(a, 3)/2al, a’ = 1/a, 
Q(a, B) = 1 — 2 arctan( tan(7a/2)), 
D = [1 + (8 tan(ma/2))?}!/?9(sin(mQ(a, B)/2a))~*. 
If y = z + |x|!7% B* tan(ma’/2), then 
Dy~*~/"g4(Dy~"/%, a, B) = galy, a’, 8). (2.3.6) 


PROOF. The equality (2.3.6) is verified by successive transformation of its 
left-hand (or right-hand) side according to the scheme 


gala, :) = gB(-,a,:) > go(-,a,-) = gol, a’, -) 


> ggl, œ,- )  ga(-,a’,-). (2.3.7) 
The middle part of this chain of transformations is the duality law, while 
the remaining links involve transition from one form to another. Therefore, 
in content (2.3.6) (if we abstract from the special form of y and regard it 
as an independent variable) is just an expression of the duality law in the 
form (A). It will be more convenient for us to go through the chain (2.3.7) of 
transformations from the end to the beginning. 
The first step is a transformation from the form (A) to (B): 


ga(y, a’, B*) = ga(y, a’, 88,0, Ab). 


Here the parameters 3% and A‘, are connected with the parameters a’ and 
B* by the equalities (see (1.19)) 


tan(7a’ 33/2) = B* tan(ma’/2), Ag cos(ma’/BR/2) = 1. (2.3.8) 
Further, by Property 2.1, 
gB(y, 0’, 85,0, àp) = (AB) /“g((AB)'/%y, a', BB) 


= (AB) “gc ( (àp) *y, 0", 0*), 
where 0* = ph, 
According to the duality law (2.3.3), the last function is equal to 


(AB) ABY e) t 990 (‘gy /%, a0), (2.3.9) 
where the parameter @ is connected with 6* by 
1+6* =a(1+4@). (2.3.10) 


The next step in (2.3.9)—a transition to the form (B)—does not change the 
form of the function, since 


go(Xg,y7 /%, 0,0) = ga(Apy'/%, a, Bp), 
where 0 = K(a)Gg/a = (1 — 2/a)Bp. 
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It then remains for us to pass to the form (A). We have 
Ney *~"/*gp(Ngy'/*, a, Bp) 
= Apy g4(Ney7!/%, a, BB,0, AA) 
= (Neda )y ga (Apa yT", a, Ba), 
In these equalities the parameters are connected by the relations (34 = 3) 
b tan(ra/2) = tan[r(a — 2)8g /2], 


(2.3.11) 
A4 = cosfr(a — 2)8g/2]. 


Thus, we have arrived at (2.3.6), in which D = Moon, But it remains to 
determine how the quantities G* and D depend on the original parameters 
œ > l and 2. This is easy to do by using formulas (2.3.8), (2.3.10), and 
(2.3.11) connecting the parameters. By (2.3.11), 


As (1 — tan?°(r(a — 2)8p/2))~1/? 
= (1+ (Btan(ma/2))*)~?/2. (2.3.12) 
According to (2.3.10), 
bB =a—1+(a-2)6g =a -1+ 2 arctan(8 tan(7a/2)). 


Therefore, 
a' bg = 1 — d'Q(a, p), 
tan(ta' bh/2) = cot[tQ(a, 8)/2a]. 


From (2.3.13) and (2.3.8) we find expressions for the parameters \’, and 8*: 


(2.3.13) 


b = (1+ tan?(ma’ bh /2))'/? 
= [1 + cot? (TQ(a, 8)/2a)]'/? = [sin(wQ(a, B)/2a)|~?, 
B* = cot(1/2a) cot(mQ(a, B)/2a). 


REMARK 1. Let u* = ĝ* tan Fa’, y= 2+ |z|1-@'w*, and l = seny. The 
equalities (2.3.6) and (2.2.12) yield an integral expression for the density of 
the complex function when a > 1 in the case not considered in Theorem 2.2.2: 


Dy~*~*/%g4(Dy—1/*, a, B) 


1 i , 
= — im f exp firs — |a|—1/9((-iu)1/@ 
m\r| Jo |z| 


— ip ul (iu a — ni} du. 
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REMARK 2. We consider a feature of the duality law which is connected 
with the presence of the condition z > 0 in the formulation of the law. On 
the face of it, this condition is not essential, since the case xz < 0 can be 
reduced to the case x > 0 by changing the sign of 8, i.e., due to the property 
g(x, a, 3) = g(—x, a, —G), and indeed, the relation 


g(x, a, B) = 2~*~“g(z-*, a’, 9’), a> l, >00, 
is complemented by an analogous relation for z < 0: 
g(x, a, 3) = g(|z|, œ, —B) = |z\~*~%9(|z|~*, a’, —8") 
= |e g(a, 8"), p= p'-2(1-a). 


Since 3’ # B” if a > 1, in this case the “intermediaries” of the parts of the 
distribution g(-,a’,-) (i.e., gı = g(-,a,-) and g2 = g(-,a,-), a’ = 1/a) are 
not component parts of some single distribution g(-,a,-). Therefore, there is 
no relation analogous to the duality law between the functions g(z, a, 3, y, À) 
and g(x %, a’, 3’, 7’, A’) in the case when z > 0, a > 1, and y Æ 0, i.e., the 
class W is the natural part of the family G in which the duality law can act. 
Furthermore, it is the “halves” g(z,a,3), £ > 0, of the densities of stable 
laws that appear as independent analytic objects, and not the densities as a 
whole. More precisely, it is not the “halves” themselves of the densities but 
the functions 
Solz, a, p) = rg9c(z, a, 8) = xg(z, a, p), 

if we consider them on the semi-axis zr > 0. For it is they that are the 
natural analytic entities for studying the group of properties of strictly stable 
laws, which properties include the duality law. Let us see how much more 
symmetric this law appears if it is written in terms of the functions ¢c. 

For any x > 0 and any admissible parameter values a, p and a’, p' connected 
by the relations aa’ = 1 and Vap = Va'p' we have the equality 

$c (2=,a,p) = $c (a) 

The group of properties connected with the “halves” of the densities or 
with the functions ¢c turns out to be so extensive that we allot them the 
main part of the next chapter. 

REMARK 3. The duality property turns out to be a very convenient in- 
strument for solving a number of problems such as, for instance, the problem 
of representing densities or distribution functions of the class G by integrals, 
by convergent or asymptotic series, and so on. For example, we already know 
that the natural intervals of the probability distributions for standard stable 
laws are the half-line (0, 00) in the case a < 1, 3 = 1, the half-line (—oo, 0) in 
the case a < 1, 8 = —1, and the whole real axis in the remaining cases. 
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One of the questions to be discussed later is the asymptotic behavior of the 
densities g(x, a, 3) and the distribution functions G(z, a, 3) on the boundaries 
of these intervals. The relations (2.3.1) and (2.3.2) shorten the amount of 
labor involved in studying the asymptotic behavior by almost half, since it is 
possible to confine oneself to analysis of the cases a < 1. In the case a > 1 the 
answer is obtained by an uncomplicated recomputation of results connected 
with the case a < 1. 


§2.4. The analytic structure of stable distributions 
and their representation by convergent series 


Stable laws have densities with uniformly bounded derivatives of any order. 
This fact was established in the Introduction. Moreover, the nth derivative 
of the density of a standard distribution has the estimate 


lg’ (x, a, B)| < a, (= * (cos [Fro] 


TQ 


The present section is devoted to explaining the more subtle analytic structure 
of stable distributions. It is convenient to carry out the analysis within the 
confines of the class Gg. Let us consider the set D of all possible pairs of 
admissible values of the parameters (a, B) except for the pair a = 1, 8 = 0. 
For each pair (a, 3) € D we define on the semi-axis z > 0 the functions 
_ | zolz a, 8) tazi, 
alz, a, p) = i ifa <1, 
G(z, a, B) if a > 1, 
Q(z, a, 8) = te —aG(z-'/*,a,8) ifa<1, 
which are connected by 
oQ (x, a, B) = q(z, a, 8). (2.4.1) 
THEOREM 2.4.1. For each pair (a, 3) € D the function Q(x, a, 3) extends 
analytically from the semi-azis x > 0 to the whole complex plane, i.e., is an 
entire analytic function. 


The derivative of an entire function is itself an entire function. Therefore, 
the product q(x, a, 3) = rQ’(z, a, 3) of two entire functions is also an entire 
function, and, moreover, it has a zero of first order at the point z = 0 ‘Gal? 

PROOF. Consider the case a = 1, 3 > 0. According to Corollary 1 to 
Theorem 2.2.1, for any real z the function 1 — Q can be represented as the 
following absolutely convergent integral: 


1 — Q(z, a, 8) = 1— G(z, a, B) 
= FM exp(—zu — ßu log u) sin (1 + Bus | 
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If in the entire function exp(—xu) appearing in the integrand we replace z 
by an arbitrary complex number z, the integral remains absolutely convergent. 
This means that the integral converges uniformly in a disk |z| < r of arbitrarily 
large radius r. Consequently, the integral and with it the function Q extend 
analytically to the whole z-plane. 

The case a < 1 is handled according to the same scheme. By Corollary 1 
to Theorem 2.2.1, 


Qiz. p =g- aG te, a, 3) 
Q du 


(00) 
= — e “Imexp(—ru%e '7?%) — 
T Jo u 


for any z > 0. The integral is absolutely convergent and remains so if z 
is replaced in the integrand by any complex number z. Consequently, it 
converges uniformly in a disk |z| < r of any radius, i.e., the function Q is 
entire. 

The case a > 1 reduces to a < 1 with use of (2.3.1): 


Q(z, a, B) = G(x, a, B) 
=1+4a'(1- 6’) - a'G(z~/ a’, B’) 
=1 -id (1 +p) + Q(z, a’, p), (2.4.2) 


where a’ = 1/a and p’ = —1 + a(1 + BK (a)/a). 

We make two remarks about this theorem. 

REMARK 1. The case a = 1, 3 = 0 corresponding to the Cauchy distribu- 
tion occupies a special position among the standard stable distributions. The 
distribution function in this case has the form 


G(z, 1,0) = : — arctan (2:) 


2 1 x—in/2 
~ 27 oni S rtin)?’ 
which shows that although G does extend analytically from the semi-axis 
x > 0, the extension is only to the half-plane with a cut (for example, along 
the segment [—i7/2,i72/2]) joining the branch points —im/2 and i7m/2 
REMARK 2. At the end of the preceding section we pointed out a feature of 
stable distributions consisting in the fact that the “halves” g(x, aœ, 3), x > 0, 
and g(z,a,3), x < 0, of the standard densities in the case a # 1 appear 
as independent analytic formations. Theorem 2.4.1 gives another intuitive 
confirmation. This is especially clear in the case a < 1. Indeed, consider the 
density g(x, a, 3) with some pair of parameter values a < 1 and 8. We can 
find the “half” g(z,a,3), x < 0, in symmetrically mapped form among the 
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“halves” g(x, a, B), x > 0, for B = —. Therefore, the functions q(x, a, 3) 
and q(z,a,—) form two parts of a single density. However, the analytic 
extensions of these functions are completely different and cannot be reduced 
one to the other by means of some simple transformation like a symmetric 
mapping. For example, if a < 1, then q(z,a, 1) has some nonzero extension 
to the complex plane, while g(z,a — 1) = 0, x > 0, has as an extension the 
function identically equal to zero in the whole plane. 
An analogous picture is observed in respect to the function Q. 


THEOREM 2.4.2. For any pair (a, 3) € D of parameter values the entire 
function Q(x, a, 3) can be represented as a power series as follows. If a < 1, 
then 


Q(z, a, 3) 2 ae i ,B) 


[(na+1) . 
ae 1 n 
=F C0 aria sin(mnpa)z”. (2.4.3) 
Ifa>1, then, with the same notation a’, 3’ as in (2.4.2), 
1 
Qna 8) = Gira 8) = 1+ po (1 + 6B") 
1 22 yn [(na/ + P(na’ + 1) 5 
+- D a aari sin(mnp)r”. (2.4.4) 
In the case when a = 1 and 8 > 0 
1 1 = n—i1 n 
Q(z, a, 8) = G(z,a, 8) = 1- =bo + = 2 (1 bna", (2.4.5) 
where 
ba = a i exp(—Gulogu)u™—! sin ja + B)u “| d u. 


By (2.2.29), the case a = 1, 3 < 0 can be reduced to the case a = 1, 3 > 0. 
The proof of (2.4.4)—(2.4.5) is based on expansion of the integrands in power 


_ series with respect to z and termwise integration of the representations of Q 
used in Theorem 2.4.1 in the case a > 1. The expansion (2.4.3) is obtained 


from (2.4.4) due to (2.4.2). 

REMARK. The equalities (2.4.1) and (2.4.3)-(2.4.5) enable us to get (by 
differentiation of the series) the following representations of the densities 
g(x, a, 3) by convergent series (2-9); 

If a > 1, then for any admissible 3 and any real z 


giz, a, p) = : D sin(mnp)r” t. (2.4.6) 


n=l 
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If a= 1 and ß > 0, then for any real z 


1 n-1 
— : 2.4.7 
g(x, 1, 8) = — yen (2.4.7) 
If a < 1, then for any admissible @ and any z > 0 
1 = n—1 T (na 7 1) ° —ħa—1 
g(z, a, B) = = dV Tin +1) sin(mnpa)xr (2.4.8) 


In contrast to (2.4.5), the equalities (2.4.3) and (2.4.4) have the feature 
that the power series on their right-hand sides contain explicit expressions for 
the coefficients. This enables us to compute the order and type of the entire 
function Q(z, a, 3) by using the following well-known connections between the 
order ø and type 6 of an entire function y(z) = }°>° anz” and its coefficients 
(see, for example, [83]): 


nlogn 


‘=e, 2.4.9) 
in SUP Fig lan | l 
1 
ô = — limsupnia,|?/". (2.4.10) 
Oe n= 


Let d=aifa<1,anda@=1/aifa>1. 


THEOREM 2.4.3. In the case a Æ 1 the entire function Q(z, aœ, 8) has 
order 
o=(1-a)"' 
and type 
6 =(1-a)q-V/0-%), 
In the case a = 1, B £0 the entire function Q(z, 1, 8) = G(z,1, 8) has infinite 
order (2-10). 


PROOF. For a Æ 1 we find with the help of Stirling’s formula that 
(na + 1)/P(n = 1) 
= exp { (à — 1)nlogn + n(aloga@ — à + 1) + 5 log à + o(1)}, 
and the values of o and 6 are found from (2.4.9) and (2.4.10) by using (2.4.3) 
and (2.4.4). 
It can be proved directly that G(z, 1,1) has infinite order by considering 


the asymptotic behavior of G(x, 1,1) on the negative semi-axis. According to 
Theorem 2.4.3, 


G(z,1,1) ~ £ 1 +2) = exp(=(1+ 2) } as tT > —O0. 


§2.4. ANALYTIC STRUCTURE AND SERIES REPRESENTATION 91 


It is known from the theory of analytic functions (see, for example, the 
well-known book [83]) that if an entire function w(z) has finite order ø, then 
for any fixed € > 0 there exist circles |z| = r of arbitrarily large radius r on 
which 

min{|w(z)|: |z| = r} > exp(—r7**). 

It is easy to see that the asymptotic behavior of G(z,1,1) as z — —o0 is 
incompatible with the assumption that it has finite order. 

We choose another route for proving that the entire function C551, 8), 
p # 0, has infinite order. Note first of all that without loss of generality we 
can assume that 6 > 0. According to Corollary 1 to Theorem 2.2.1, for any 
real x 


G(z,1,8)=1- =f exp(—ru — Gulog u) sin [Za + B)u] = 


It is not hard to see that the integral remains absolutely convergent if z 
is replaced by some complex number. This means that the integral gives 
a representation of G(z, 1,8) in the whole complex z-plane. Consider the 
sequence of complex numbers 


Zk = —Le — 15 (1+ B) = —B(1 + log ux) — iZ (1 + 8), 


where ux = (1+ 4k)/(1 + 8), k =1,2,.... 
Let w = (2(1 + 3))~'. Elementary calculations show that 


Im G(zx, 1, 3) 
a f o (zku — Bulog u) (sin fi +B)u) es 
2 
> = a exp(z,u — Bulog u) (sin 50 + B)u) = 
1 3 + 8k 
> r log (1) exp{Tp (up — w) — B(uk + w) log(uz + w)}. 


This quantity has the asymptotic expression 
= exp{bux — (1+ B)w log uk — 28w + o(1)} 
= exp{ßexp(zk/ß — 1) + O(zk)} 
as k — oo. Now let 
M(r) = max{|G(z, 1, 8)|:|2| = r} 


and note that 
M(|zk|) > Im G(zx, 1, 8) 
> exp{ßexp(5 Rez, — 1) + O(Re zx) }. 
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Since |z,| = Re z,(1+ 0(1)), the last inequality = us 
lim sup + log log M (r) > 3 > 0, 
r— oo 


which, in turn, implies that G(z, 1, 8) has infinite order. 

REMARK. As is known from the theory of analytic functions, any derivaitve 
of an entire function has the same order and type as the function itself. In 
particular, it follows from this that Theorem 2.4.3 is valid also for the function 


g(z, a, p). 
In concluding this section we give without proof a certain interesting fact. 
The reader desiring to see a proof is referred to [7], whence the fact was taken 


Taai t 


Consider a standard stable distribution with parameters a < 1 and 2 = 1, 
and let 


(s) Heti VPS, ose r(it+s+n)c* 

L (£) = | s~n (—1)* =n UOO 
T(n+1+s) = T(k+1I(n-k+1T(1+s+n) 
be the Laguerre polynomials (s > —1, n = 0,1,...) which form a complete 
orthonormal system in the Hilbert space of real functions on [0, 00) whose 


squares are integrable with respect to the measure p(dr) = r°e “dz. 


THEOREM 2.4.4. For any0 <a<1andz> 0 the function q(z, a, 1) 
can be represented by the convergent series 


q(x, a, 1) = zexp(— >» k) (aL (2), (2.4.11) 


where s is any fized number greater than —1, and 


ke (a)=a (res) 1/2 


>> —1)"T(1 s+ n) 
Tim + LP nae 
In the case a > 1, z > 0 the expansion in a convergent series of the function 


q(x, a, 1) = exp(—z) S Ki) (1/a) Li?) (2) 


n=0 
is obtained from (2.4.11) with the help of (2.3.3). 


§2.5. Asymptotic expansions of stable distributions 


As mentioned in Remark 4 after Theorem 2.2.3, the support of the measure 
of the distribution G(x, a, 3) is the semi-axis (0,00) if a < 1 and 8 = 1, the 
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semi-axis (—0o,0) if a < 0 and 8 = —1, and the whole real axis otherwise. 
The material in the preceding section does not enable us to determine the 
behavior of the entire functions G(z,a, 3) and g(z,a, 3) at the point z = 0 
to < tnor ata = co f1< a2 < 2 (this does not concern the case when 
a= 1 and 3 = 0), because these points of the complex plane are singular for 
them: namely, a branch point in the first case and an essential singularity in 
the second. Below we correct this deficiency of information by constructing 
asymptotic expansions of G and g in a neighborhood of the corresponding 
singular points. It turns out that the forms of the asymptotic expansions differ 
considerably, depending on whether or not we are dealing with an extremal 
distribution (i.e., 86 = +1) and on whether a < 1,a=1, ora > 1. 

In this connection the following six possible situations distinguish them- 
selves (because of Property 2.2, it suffices to consider the asymptotic expan- 
sions only for x > 0): 

La<1,841,2-0. 

Lagzi B= 1, 2s 0. 

Il a=1, 84-1, B40, « > oo. 

IV.a=1, p= x => oo. 

V.a>1,84#-1,2r—-o. 

VLa>1, @=-1, x— oo. 

The arrangement and grouping of the material in this section does not 
follow precisely the indicated division into six cases; hence, some explana- 
tions are necessary. The asymptotic expansions given below (the concept of 
an asymptotic series is the one traditionally accepted in mathematical anal- 
ysis) are divided into two groups. The first (Theorems 2.5.1-2.5.4) contains 
asymptotic formulas for the densities g(x, a, 3) and the distribution functions 
in neighborhoods of the corresponding singular points. The second group 
(Theorems 2.5.5-2.5.7) combines the asymptotic expansions for g(r(z), a, p) 
and G(r(z), a, 3), where the r(x) are functions of a special type. Furthermore, 
in the cases when the expansions of the distribution functions are obtained as 
consequences of the asymptotic expansions of the densities corresponding to 
them, the former expansions are not singled out in independent statements. 
The same principle is followed also when one of the cases mentioned above can 
be reduced to another by using duality relations (case V reduces to case I, and 
VI reduces to II). Cases II, IV, and VI can be combined in a single analytic 
form of expression, so they are considered in the two Theorems 2.5.2 and 2.5.3 
combining them, of which one has to do with expansion of the distribution 
functions. 

The case a = 1, 8 = 0, for which there are simple explicit expressions for 
the density and distribution function, is not considered at all. 
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THEOREM 2.5.1. Ifa<1and i i 1, then the asymptotic representation 


1 
glz, 0, 8) ~ — ee Meroe sin 5 (k + 1)(1 — B)x* (2.5.1) 


holds as x — 0. 


PROOF. Let us use the integral expression for the density in the form 
(2.2.1) and write 


g(z,a, 8) = = Re f exp (itz — t* exp (iZap)) dt 


ae are f (it)* exp (-t° exp (i508) ) dt + Rn, 
T k=0 k! 0 2 
where 
n—-1 _. 
= 1 = itz (itx)* a N 
l = Re | k — 2 i | exp (-t exp (‘5a8)) dt. 
We have that 
Ley T 
a = me n _4Q au 
|Rn| < =f mtz) exp ( t cos 5a) dt 
— 1 T((n+1)/a) m a)-(ntl)/a 
= ma T(n+1) [cos Zap] D (2.5.2) 


In the main sum in all the integrals we perform the change of variable w = 
texp(itĝß/2) and a subsequent rotation of the contour of integration in order 
that the integration again be carried out along the semi-axis (0,00). The 
expansion (2.5.1) is obtained as a result. 


COROLLARY 1. Ifaæa< 1 andß £1, then as t — 0 
1 
G(z, a, 2) ie G(0, a, 8) == G(x, a, ß) Ja 5 (1 _ b) 


SH a si in 5 k(1 — D)e k (2.5.3) 


This can be verified with the help of (2.5.1) and L’Hépital’s rule. 
COROLLARY 2. Ifa>1 and ß 4-1, then as z — 0 


g(z,a, 8) ~ —27! 5 i < Skl 2- a1 +8 "e. (2.5.4) 


Indeed, according to on fora >landzr>0 
gt, a, 8) aa foe, Bie 8), (2.5.5) 
where #” = —1+ a(1+ b(a — 2)/a) =1-—(2—a)(1+4 8). 
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We have the asymptotic expansion (2.5.1) for the right-hand side of (2.5.5). 
This implies (2.5.4). 
In turn, we find from (2.5.4) that if a > 1 and 8 Æ 1, then as x > œ% 


1 — G(z,a, b) ~ Eo, a sin = z k(2 - a(l +8) Āe. 
(This can be established with the help again of L’Hôpital’s rule.) 

The analysis of the asymptotic behavior of a stable distribution as z — 0 
in the case a < 1, 8 = 1 and as z > œ in the case a = 1, 8 = —1 is based on 
the well-known method of Laplace for asymptotic representation of integrals. 
The assertion taken as a basis for the analysis is one of the simplest variants 
of theorems of this kind. One can become acquainted with such theorems, for 
example, in the detailed monograph [20]. 

Let us consider even functions s(t) and w(t) that are analytic on the interval 
(—7,7) and have the following additional properties: 

1) w(t) is strictly monotone on (0,7). 

2) p= s(0) > 0, r = w(0) > 0, and o? = w” (0) > 0. 

3) s(t) = O(w(t)) ast > m. 

With the help of these functions we form the integral 


ig = = f s(t) exp{—Nw/(t)} dt, (2.5.6) 


which obviously exists for each N > 0. 


LEMMA 2.5.1. The integral Iy has the following representation by an 
asymptotic series as N — ov: 


In ~ aN v2 exp(—7TN) It D aN a ; (2.5.7) 


where 


Qao = J. qn (t) exp (-5) dt (2.5.8) 


and the functions q,(t), which are polynomials, are the coefficients of the 
series expansion, in powers of h? = (o*N)~!, of the even function 


w(t,h) = “sith exp { a (th) To a ! 


=1+ = Qn (t)h?”. (2.5.9) 
j=l 
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PROOF. We break up the integral Iy into two parts, with € = N~!/°: 
1 1 
In = — + | = Iy + In. 
2T lil<e 2T Ee<|t|<r 


The second term can be estimated by using properties 1)—3) of the functions 
s(t) and w(t) and the fact that zexp(—z) is monotonically decreasing on the 
semi-axis x > 1. For sufficiently large N we have (c, c’ and c” are constants) 


In < e w(t) exp(—Nw/(t)) dt 
> 
< mew(€) exp(—Nw/(e)) < c exp(—Nw/(e)). 
Since w(£) = 7 + o%e?/2 + o(et), it follows that 
Ih < c" exp(—1N — 0? N13 /2). 
Consequently, this part of the integral Iy is infinitesimally small as N — oo 


in comparison to any term of the asymptotic expansion (2.5.7). 
In the first term, after transforming it to the form 
N-1/2 


e/h 
T _ —t?/2 
In a exp( Nj e w(t, h) dt, 


we expand w(t, h) in an asymptotic series (2.5.9) of powers of h? and integrate 
termwise. It is then verified that the increase in the domain of integration of 
the integrals 

e/h 

f anlt)exp(—t/2) a 

—e/h 
to integration over the whole real line changes each of them by no more than 
O(exp(—N"/4)). 

We consider the case a < 1, 8 = 1 for z — 0 and the case a > 1, 8 = —1 
for x — oo. It turns out that Theorem 2.2.3 and Remark 1 after it make it 
possible to write g(x, a, 3) and G(z,a, 8) in the form of the integral (2.5.6). 
Let 


eee |1 —al(z/a)*/(°-) ifa £1, 
exp(zx — 1) itm i, 
. a/(a—-1) .. 
|= 
(=) sin( ap foc, 
w = w(p, a) = asin p (1 — a)sin p 
exp(1 — pcot p) ta=1 


sin Y 
(in the case a > 1 the function w = w(y, a) is defined with the help of the 
function w defined forf œ < 1 by replacing a by 1/a), and 


y= v(a)= {abv ifa #1, 
1 tai 
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It is not hard to see that with this notation the equalities (2.2.18) and (2.2.19) 
can be reduced in the cases a < 1, 8 = 1 and a = 1, 8 = —1 to the single 
equality 
Ae 1/a 7 
a(z,a,8)= Zee f wexp(—€w) de. (2.5.10) 


The fact that this formula remains true also when a > 1 is quite easy to 
check by using the daulity relation (2.3.3) and the following properties of the 
function £: 

If a > 1, then for anyx>0 


E(x “,1/a) = €(z, a), 
y(1/a)E*(z~%, 1/a)z-“UF®) = v(a)é/%(z, a). 

Similarly, (2.2.27), (2.2.28) and the duality relation (2.3.1) can be used to 
write the distribution functions in a unified form in the cases a < 1, 3 = 1 
and a > 1, 8 = —1. Namely, 

= l. exp(—f£w) dp = Pees -1) a i i (25.11) 

For convenience in presenting the subsequent material we introduce the 

quantity 


Os = i oah (2.5.12) 


l/a ifa>1. 
The function €(z,@) converges to oo in the case a < 1, 8 = 1 as z — 0 or 
in the case a > 1, 8 = —1 as z > œ, while w(y, a) (as is immediately clear 
from its definition) is an even analytic function on (—7,7) for which 


q w(0, a) = 1, f= w (0, a) =a, > 0. 


Since w(p, œ) differs from the function a(y) in (2.2.35) by only a factor, it 
follows from Lemma 2.7.5 that w is strictly monotone on (0,7). 

This means that w satisfies conditions 1) and 2) in Lemma 2.5.1. The fact 
that 3) also holds is obvious, because the role of s(t) is played in (2.5.10) 
and (2.5.11) by either w(t,a) or the function s(t) = 1 < w(t,a). Thus, 
Lemma 2.5.1 can be applied to the integrals (2.5.10) and (2.5.11). For the final 
formation of the asymptotic expansions of interest to us we take a detailed 
look at the expansion (2.5.9), on which the form of the coefficients Qn in 
(2.5.7) depends. 

Let us first write out the power series expansion for log w(y, a) in the case 
a < 1 by using the known expansions for the functions log(sin z/x) and zx cot x 
(see 1.411 and 1.518 in [27]). We have that 


log w(p, a) = X` an(a)y", (2.5.13) 
ol | 
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where 2ni Bon} Ta(1 — a2") 
uala) = memi] a = Ne (1 — a) | 


are polynomials of degree 2n — 1 with constant term equal to zero (the Bn 
are the Bernoulli numbers). From this we get by the Faa di Bruno formula 
that if a < 1, then 


9 (ee) 
_ p 2n 
w(p,a)=1+ a-y + 3 bnla) p”, (2.5.14) 


where i 
by, (x) = ion (Lar, 2a2, img le) 


D ere (tt) E 


ky + 2ko + +++ +nkn =n, ky > 0} 


and 


are the so-called Bell polynomials (see {70]). It is easy to see that the b,,(a) 
are polynomials of degree 2n — 1 with constant terms zero. 
It follows from (2.5.14) that for a > 1 


2 fore) 

p 2n 

=1 Ea Dn * ; 
wlp,a)=1 +a tà (as) p 


where b,,(@,) is a rational function of a. 
Thus, if s(y) = w(p) = w(w, a), then (2.5.9) has the form 


1 Oy 
w(yp, t) = exp {log w(ypt, a) — -y w(t, a) = p = pt? 


t 
= exp (Sa (p,a a)n) =1+ > talp a)t?”, (2.5.15) 
=l n=1 
where 
dn(p, a) = (an(as) — Y7bn41(a%)/as)?" 
and 


Qn(~, a) = Cn(1!d;, 2!do,...,n!d,,)/n! 
are polynomials of degree 2(n +1) in the variable p and polynomials of degree 
2N M Qy- 


But if s(p) = 1 and w(p) = ,&), then 
242 
w(p, t) = ex fest te w(t, a) — 1 — sern 
=1+ > gpa, (2.5.16) 
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where gn(, @) is obtained from qn(p, a) by setting ay = ag = --- = 0 in 
(2.5.15). The g,(y,a@) are thus polynomials of degree 2n in the variable a.. 


THEOREM 2.5.2. Suppose thata <1, 8 = 1, and z —> 0, or that a > 1, 
B = —1, and z — œ. Then 


V —Qa Q 
g(x, a, b) sie Tat )/2 


x exp(—€) ( +> Qna Ja) (25.17) 
n=l 

The cofficients Qn are given by (2.5.8) with the functions qn in (2.5.15), and 

are polynomials of degree 2n in the variable a, 


The proof of the theorem reduces to the application of Lemma 2.5.1 to 
the integral expression (2.5.10) for the density. Furthermore, it is clearly 
necessary to take into account the information contained in (2.5.15). 

REMARK. In the formulation of the theorem the variable £(z, a) was used 
both to obtain a concise expression and to demonstrate a single form of the 
asymptotic expansion in quite strongly divergent situations. For convenience 
in using (2.5.17) we write out its principal term (%12): 


2- 2(a-1 
Vy ¢(2—a)/206~€ = (x/a)! a)/ (a ) 


27a V 27a\1 — al 


a/(a—1) 
x exp { -l1 -al (2) (2.5.18) 


in the case a Æ 1, and 


ett Moet m = are Cy E a (2.5.19) 
in the case a = 1. 

If as a starting point we now take the representation (2.5.11) for the dis- 
tribution functions of the stable laws under consideration, then Lemma 2.5.1 
together with the expansion (2.5.16) leads us to the following assertion. 


THEOREM 2.5.3. Suppose that a <1, B = 1, and z —> 0, or that a > 1, 
B = —1, and x — œ. Then the asymptotic expansion of G(x, a, 3) in the case 
a < 1 or of 1—G(z,a, 3) in the case a > 1 has the form 


aa (: +) Qa(a.)(ae8)™) (2.5.20) 


n=1 


where the polynomials Qn, of degree 2n are given by (2.5.8), in which the 
functions gn(y, a) are determined by (2.5.16). 
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The principal term in (2.5.20) is transformed as follows if the function 
E(x, œ) is replaced by its expression: 


1 7 (ajaj e/a T a/(a—1) 
mne oT et A ee, ee. | 2:0.21 
V 270k V 27a\1 — a| P (7 (3) ( ) 


if a ~ 1, and 


-1 
: es = a exp -= — a) (2.5.22) 
T 


tasi, 

REMARK 1. The asymptotic expansion (2.5.20) can be differentiated with 
respect to z. We first obtain (2.5.17), and then asymptotic expansions for 
the derivatives of the density (with a corresponding sign change). The admis- 
sibility of this formal procedure follows from the form of the representation 
(2.5.11), which remains an integral of the type Jy in Lemma 2.5.1 after dif- 
ferentiation. It is useful to keep in view that (differentiate (2.5.20)) 


O£/Ozr = —e(a)v(a)€. 
In this way it is not hard to establish a connection between the polynomials 
Qn and Qn. Since the form of the polynomials does not change for any 


0 < a < 2, it follows that without loss of generality we can assume that 
a < 1. Differentiating (2.5.20), we have 


g’ 


C (a, a, 1) = e A- g= $ Qnn(aé)~” 
n>1 
+ (1 + se) 1+ X Qn(aé)~ 
n>1 
V 


z —— ¿(2-a)/2a o-€ 1+ iJ (Qn L 5 (an + 1)Qn-1) fae yo ™ 


27a n>] 


From this we get 
Q 


Qna) 5 (2n + 1)Qn—1(a) + Qn(a). 


Direct computation of the polynomials Q,, can be realized somewhat easier 
than for Qn, and thus the recursion relation between them proves to be useful. 
For example, a computation shows that 


Qi(a) = -4 (2 + Ta + 207). 
From this, considering that On = 1, we have 


Qi(a) = -4 (2 — 29a + 20°). 
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REMARK 2. The explicit expression for the polynomials a, shows that 
a have rational coefficients. The mechanism for forming the polynomials 
bn, Qn, and On preserves this property for them. It is interesting to note 
that the polynomials qn, bn, Qn, and On have symmetric coefficients. Indeed, 
simple transformations show that in a sufficiently small neighborhood of zero 


w(y,1/a) = w(y/a, a), 0< a =.2. 
Expanding both sides of this equality in series of powers of p, we get 
bn(1/a) = a *"b, (a), 
and, expanding the logarithms of both sides in series, we get 
An(1/a) = a~*"an(a). 


Consequently, the polynomials Q,, and aA formed from them have the same 
property. But this means that all these polynomials have symmetric coeffi- 
cients. 

The last of the cases being considered, namely, a = 1 and 8 ¥ —1, is very 
distinctive. 


THEOREM 2.5.4. Ifa=1 and-1 < 8 < 1, then the following asymptotic 
expansions are valid as x — ov: 


1 1, 
= 5 —P,(logz)z~"~! 2.5.2 
g(x,1, 8) ~ = rie m (log x)xr i ( 3) 
where P,(y) = J) i-ofint', and 
_ S a) fm m-lp(m-l) 
= -— $ 1 
rn = (p) (T) (1+ n) 
mi T n-m 3 T A , 
x B (F(.+ 8) sin 5 (n m); 
and 
1 — G(z,1, 8) ~ Eg (log x)a~" (2.5.24) 
(x, z Žo al g ; 5. 
where P*(y) = X i-oriyt, and the coefficients rè, differ from rin only in 


that the ues of the T -function at the point 1+ n appearing in them are 
replaced by the same derivatives at the point n. 
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PROOF.* In the representation (2.2.1b) of the density g(x, 1, 8) we change 
the contour of integration, replacing the nonnegative semi-axis by the com- 
pound contour L = Lı U L2, where 

Lı ={r=ut+iv:u=0, -A<v< 0}, 
Lz = {z =u +iv:u > 0, v=-—-A}, A= >O. 

The justification for replacing the contour of integration in (2.2.1b) by the 
contour L is elementary and reduces to a proof that the integral along the 


contour Ly = {z = u+ iv:u = N, —A < v < 0} tends to zero as N — oo. In 
summary 


1 as , T , 
as 1p) = = Re f exp(—itz — 3! — it logt) dt 


1 1 
Tref ==Re f +) = — Re(J, + I2). 
T T; T Li T is T 


Let us estimate the integral Jz. After the change t = u — tA of the variable 
of integration we find that 


i T 
|I2| < i exp (-A ae Bd(z, u)) du, 


< 


where d(z, u) = Im[(u — iz™ 1/4) log(u — ix !/4)]. Since |8| < 1 and |d(z, u) 
£714 + $a-1/4 log(u? + 21/2), it follows that for z > 16 


3 


> log(u? + D) du 


oo 
\I2| < exp(—23/4 + | exp (Fu + 
0 
= Cı exp(—2°/*), 


where C; is a constant. Let us now estimate J4. In this integral we make the 
change of variable t = —2v and get that 


A 
T v v v 
izli =j exp (-v+i30 Fa — b7 log = ) dv 


(COS (bso ot] aoe 


* Added in translation. There are some gaps in the proof of Theorem 2.5.4 in the 
Russian edition. The proof below is free of these defects. It was proposed to the author 
by A. Lisitskii. He pointed out that the asymptotic series (2.5.32) is actually a convergent 
series (at least for sufficiently large values of z). Note the great similarity between the 
proofs of Theorems 2.5.4 and 2.5.5, although they treat different asymptotic expansions. 
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where 
bac [ e7” Gii iza + B) — Blog “yn dv 


x wif a-o" opfe (1530 +8) -Blog >) } ae. 


x 
We estimate Ry: 
1 fA po, N41 por vi\N+1 v V 
e = ad be = jl ee 
Rol <a ff (Z) (0+0 + flow |)" exp (2 tog 2| - v) d 
A 
< aD ele | (x +logz + |logv|)Nt!e™”? dv 
-Jo 
< C2(N)(x~* log x)%*1, 


where C2(N) is a constant depending only on N. 
Next, we consider the integral 


s>] (oy iS - Blog”) e~” 
I} =| (=) ELETI Blog =| e "dv 
and estimate it for z > k +2: 
oo k 
[I3| < | (=) (x + logv + log x)*e~” dv 
A L 
< (m + 2)* (log aat i 
A U 
< [(m + 2)z~! log z] AF t1e—4 
< C3(k) exp(x?/4/2). 
Combination of the estimates obtained for the integrals I2, I3, and Ry now 


gives us that for any integer N > 1 
N k-1 fore) k 
g(x, 1, 8) = 2 Zr im f hiža + 8) — Blog =| ve” dv 
+ O(2%-? log’ +! x) 
as © > OO. 
The last step of the proof consists in a transformation of the integral which 
is the coefficient of the power z~*~!. We have that 


i sA r]k k =0 
im | [iS (1 + B) + Blog =| ve “du 


-D (i) S (20083)! (G+ 9)" san5- tetan 
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A further simple transformation of this sum, with the equality 


taken into account, leads us to the conclusion that it is a polynomial P, (log z) 
of the form indicated in the theorem. 

It is not hard to see that we are justified in integrating the asymptotic 
representation for the density from z to oo. This gives an asymptotic series 
for the function 1 — G(x, 1,8) as x — oo. 

A sequence of simple transformations shows that 


J P;(log y)y ~"? dy 


k 
= yo ' dayim | E (1+8)- - plog vee” dv 
= ¢~* P? (log z). 

This concludes the proof of the theorem. 


THEOREM 2.5.5. Ifa = 1 and—1 < 8 < 1, then the following asymptotic 
expansions are valid as x — œ (%13): 


1 Oo 
] r D 
g(x, ßlogxz, 1,8) ~ m Aia (2.5.25) 
where 
[(n—1)/2] 
1 n T 2m+1 
a. be and = n+m-—-1 bal 
n! 2 CY (ome) (G0 +9)) 
x e ia t ai E 1); 
1 — G(x + Blogz, 1,8) ~ 33 zid da + Edasi T 2.5.26 
g = = n—l1 T 3 ( EER ) 


where it is assumed that dp = 0. 


PROOF. By the inversion formula (2.2.1b), the density g(x, 1, 8), with the 
variable x replaced by x + Glog z, satisifes 


g(x + Blogz, 1, 8) 


1 
— Rel 
TI 


1 a a T oe 
Re | exp (i Fr (z — 1B log r) ) at (2.527) 
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We replace the integration along the semi-axis t > 0 in this integral by inte- 
gration along the contour L = Lı U Lo in the complex plane of z = o + ic, 
where 
L,={z:0=0,0<€<2}, L= {z:o > 0, &=2}. 

The justification for this change is elementary, and we omit it. The integral 
I in (2.5.27) after the sign for the real part can be writen as a sum J, + Ip 
of integrals, the first along Lı and the second along Lz. After the change of 
variable t = 7€ in the integral J; we get that 


I, =i [exp |--$ [iF +8) + pige] } dE 


For any € > 0 


IQ| < (7+ |logé|), — | ReQ| < €|log £l, 
and so, for any N > 1 and0 < <r, 


o(-2)-Ea(sye 


n=0 


A exp (+ Regl) < (2) m 


which implies the estimate 


x N 
|Rn| <f e§ (2) max (e3) desu”, 


where c is a numerical constant. Further, let 


lg = if e SQ" dE 
0 


wo (n\ (7 er REE 
=i A (30+) f e~ E (Blog €)” -1 E" dé 


An elementary computation shows that as r — oo 


if e £Q" dé = Il, + O(x" log” x exp(—z)). 
0 
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Consequently, as t — 00 


N-1 l n 
Reh = = j Reln (- 3 + O(a"), (2.5.28) 


m ie 
Let us estimate the second integral. Since 


Ip =J exp (iz - = (s - iB log z) ) dz 
[exw -2+i0- a (s - iBlog(o + ix)) | do 


= T 
=e * exp 4 izo — (o +7) (= — 7G log|(o + i)z] ) } do, 
J, e] (; )) 
it follows that 
|I2| < | exp { -50 — Bllogo +o arg(o + 1))} do. (2.5.29) 
0 


Combining (2.5.28) and (2.5.29), we get for any fixed integer N > 1 


N=1 n 
1 1 1 _N 
= — = — 25. 
g(x, 3 log x, 1, 8) = 2, 7 Rel, ( 3 +O") (2.5.30) 
as z — oo, and it remains only to check that (—1)” Rel, = dn. 

The asymptotic expansion (2.5.26) is obtained as a consequence of (2.5.25). 
Indeed, since 


1 — G(x + blogz, 1, 8) = fa + B/u)g(u + Blogu, 1,8)du, (2.5.31) 


substitution of the expression (2.5.30) for the density into (2.5.31) gives us 
the expansion (2.5.26) after the corresponding transformation. 

REMARK. The case 3 = —1 could have been formally included in the 
statement of the theorem, since all the arguments remain in force. However, in 
this case all the d, are 0, and the substance of the theorem’s assertion reduces 
to the claim that the functions g(x — log z, 1,—1) and 1 — G(z — log z, 1, —1) 
decrease more rapidly than any power function. 

Theorems 2.2.2 and 2.3.2 can be used to obtain another type of asymptotic 
expansion of stable distributions related to (2.5.25) and (2.5.26). 


THEOREM 2.5.6. The following asymptotic expansions are valid as x — 
o0 (uw = Btan(7a/2)): 
Ifa <1, then for any admissible 8B 


galz + pr'~*, a, b) ~ 


ale 


Yo An (Ea (2.5.32) 
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where 


ala ainaka 1 il be o 1)*; 


A P(ak+n—k+1) 
im T(k (=H 


+1)? (n-—k+1) 


1—Ga(r+ us 0, 8) 
Ty [An (a, B) + (1—a)pAn—1(a,B)|2~°", — (2.5.33) 


T 
n=1 


where Ao = 0 and An, n > 1, are the same as in (2.5.32). 


The proof of (2.5.32) is carried out by expanding the integrand of the 
corresponding representation of the density (2.2.12) in an asymptotic series. 
The asymptotic expansion (2.5.33) is obtained from the latter by multiplying 
by 1+ (1 — a)ua~° and integrating from z to co. The computation of the 
form of the coefficients An (a, 3) in these expansions is elementary and will 
not be given. 

REMARK. The series (2.5.32) is interpreted as asymptotic in the theorem. 
In fact, it can be shown to be convergent (what is more, absolutely convergent) 
for all sufficiently large z. 

Another expansion, though no longer at infinity but in a neighborhod of 
zero, is obtained with the help of (2.3.6) and (2.5.32). Let us use the notation 
in Theorem 2.3.2 (a = 1/a, y = t + WTI”, B= Br, ww’ = B' tanara /2), 
etc.). 


THEOREM 2.5.7. Ifa > 1, then the following asymptotic expansions hold 
as tT — œ for any admissible 2: 


1 (0.0) 
—1— i/ag —1/a = /\,,—1—n/a 
Dy A(Ky ,a, B) ~ - 2, Anl a Byz , (2.5.34) 
Ga(0, a, 8) = Ga(Dy~!/%, a, B) 
~ È [An (a, p’) + (1 a')u' Ani (a, B), 
TO por 


where the An(a, B) are the same as in (2.5.33). 
These relations are proved by direct comparison of (2.3.6) and the expan- 
sions (2.5.32) and (2.5.33). 
§2.6. Integral transformations of stable distributions 


For the information on the properties of the distributions in G gathered in 
the preceding sections of this chapter we are indebted first and foremost to the 


108 2. ANALYTIC PROPERTIES OF DISTRIBUTIONS IN G 


fact that the characteristic functions of these distributions have a simple form. 
Thus, the explicit form for the Fourier transforms of the densities of stable 
laws can to some extent compensate us for the absence of explicit expressions 
for the densities themselves. At the same time, the Fourier transformation is 
only one of many integral transformations known in function theory. And it is 
natural to expect that each integral transformation of the densities of stable 
distributions (or of some functions connected with them) that is obtainable in 
a sufficiently simple form can act as a source of interesting information about 
their properties (see Chapter 3). 

The material in the present section is divided into two parts. One con- 
tains several assertions to be proved. It precedes another part which carries 
the main idea of an algorithm for simplifying the analytic computations in a 
number of cases. The arguments connected with this algorithm bear a formal 
character because of the desire to isolate it in pure form, and further refine- 
ment and substantiation is required in each concrete case. (For example, it 
must be specified in what sense the integral defining the integral transform 
converges, or what the conditions are which allow changing the order of inte- 
gration, and so on.) 

Denote by D some subset of the real axis R, by J a subset of the com- 
plex plane C, and by f(x) and h(s,z) functions with domains D and I x D, 
respectively. Let us consider the integral 


(Hf)(s)= | h(s,x)f(x) dz, sel, (2.6.1) 

where the sets D and J and the function h called the kernel of the integral 

transformation ¥, are fixed, while f, which is chosen from some set §, is a 

function variable. The kernel h is chosen so that functions f € § can be 

reproduced in their domain D from the corresponding transforms (X f)(s), 
considered in the domain /.* 

We give some examples of well-known integral transformations with which 
we have been or shall be concerned. 

(a) The Fourier transformation (¥ = F) corresponds in (2.6.1) to the kernel 
h(s,x) = exp(zsx) and the sets D= I = R. 

(b) The two-sided Laplace transformation (¥ = N) is connected with the 
kernel of form h(s, xz) = exp(—sz) and with the sets D = R and I = {z:—c, < 
Rez < co}, where cı and c2 are nonnegative numbers. It can be said that 
this transformation is related to the transformation F, as is clear from the 
relation connecting them: 


(Nf)(-ts) =(Ff)(s), sER. 


*The very detailed monograph [82] can be recommended for learning about the well- 
developed theory of integral transformations with various types of kernels. 
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(c) The one-sided Laplace transformation (¥ = £). It corresponds to 
the kernel h(s,x) = exp(—sz) and the sets D = R+ = (0,00) and J = 
{z:—cı < Rez < co}, where cı and c2 are nonnegative numbers. Obviously, 
(N f)(s) = (Lf)(s) on the set of functions f equal to zero on the semi-axis 
t< 0. 

(d) The Mellin transformation (¥ = M). The sets D and I here are of the 
same form as in the one-sided Laplace transformation, but the kernel has the 
form A(s,z) = 2°. 

Instead of the kernel r*~' usually taken for the Mellin transformation (see 
[82]), we use the kernel x’. This inessential change is due to the desire to 
simplify the analytic expressions for the transforms (Mg)(s) of densities of 
stable laws. We could have kept the traditional choice of the kernel but then 
considered transformations of the functions zg. 

To these known transformations we add one more, which has so far found 
application only in probability theory and probabilistic number theory. 

(e) The characteristic transformation (4 = W) has as kernel the function 
h(s, xz) taking values in the set of 2 x 2 diagonal matrices 


_ [ hols, 2) 0 
Hi = ( 0 kad) i 


where hg(s, £) = |z|**(sgn x)*,k = 0,1, and 0% = 0 for any complex numbers 
s, while the sets are D = R and I = {z:—cı < Rez < c2}, where cı and c2 
are nonnegative numbers. 

If the function f in (2.6.1) is written in the form of the integral 


fiz) = [ h,(z,u)f.(u) du, xe D, (2.6.2) 


then the transformation (¥f)(s) gets an expression as a double integral. This 
kind of situation is encountered in considering integral transforms of functions 
of the form 


f(x) = U(x)g(r(x), a, 8,4, A). 


We have observed individual cases of such representations of functions by 
integrals in (2.2.8)—(2.2.10), (2.2.12), (2.2.13), (2.2.17), and (2.3.14). 

The original transform (¥f)(s) gets a simplified representation by single 
integrals in the cases when the integral ip h(s,r)h,(x, wu) dx can be computed 
in explicit form by regarding it as an integral transform with kernel h or h,. 
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We present some formal equalities that can serve as a starting point in the 
indicated direction for finding explicit or simplified expressions for transfor- 
mations ¥ of functions of the form 


f(x) = U(x)g(r(x), œ, B, Y, A). 


Let D = R and I = R. Using the inversion formulas (2.2.la) and (2.2.1b) 
for expressing the density g(x, a, 3), we find for functions of the form f(z) = 
lg = l(x)g(x, a, B) that 


(H1g)(s) = ~Re [ a(t, a, B)(Flh)(—t) dt. (2.6.3) 


0 

The representations (2.2.8) and (2.2.10) of the densities g(x, a, 3) can also 
be used, but for certain values of 8 they prove to be inconvenient for our 
purposes (complications automatically arise in justifying a change in the order 
of integration), and thus they require a preliminary change. Modifications of 
the equalities are obtained by rotating the contour of integration in (2.2.8) 
by the angle mp/2 and in (2.2.10) by the angle —7p/2. In all the cases with 
a # 1 the new variant of the expression for the density g(x, a, 3) is the same 
(however, in the case a < 1 it is nevertheless assumed that x > 0): 


1 T , ) 
glz, a,b) = = im f exp (—zue'*" = Ate eRe? St. ze) du. (2.6.4) 
0 


For D = R*+ we get from this the following formula for the transformation 
X of the function f(x) = I(r)g(z,a, 38), a #1: 


(H1g)(s) = — rim f exp (=u Rg POS 4. ze) (LIh)(ue'™?/?) du. (2.6.5) 


But if a = 1 and > 0, then to get an analogous formula we must use the 
representation (2.2.9) of g(x, 1,3), and we obtain 


(Hlg)(s) = : 5 exp(—ßu log u) sinfr(1 + 8)u/2)(Llh)(u) du. (2.6.6) 


In the case when D = R, a = 1, and 8 > 0, the same representation for 
the density leads to the formula 


(¥lg)(s) = S exp(— bulog u) sinfr(1 + G)u/2|(NIA)(u) du. (2.6.7) 


Let us consider the case D = R* and the functions f(x) = l(xr)g(z, a, 8) 
with densities g in the class W. Using the expression (2.2.18) for the density 
and writing 


w(s,r2) = I(z1-1/*)h(s, 2171/2), 
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we get that 
(Hlg)(s -if Ua (p, 0)(Lw)(Ua (p, 0)) de. (2.6.8) 
In the case of a compound argument r(x) of special form 


pHa ¢1-l/oB tan Ea ifa <1, 
a" = Glogs fe = i, 


the representations (2.2.12) and (2.2.13) for lga = U(x)ga(r(x), a, 3) lead to 
the following formula for a transformation ¥ with the set D = R+ (a <1, 


B > 0): 
(Higa )(s) = = smf e “(Ca!/*1h)(T) du, (2.6.9) 


where 
_ f —(iu)® — up tan Zaf[(—iu)®-! - 1] ifa <1, 
tpa Peri fost 


In the case œ > 1 an analogous formula is obtained with the help of (2.3.14). 
We do not give it because of its unwieldiness, but it is not difficult to reproduce 
after the remarks above. 

Formulas (2.6.5)—(2.6.9) serve as a source of many interesting analytic re- 
lations. However, very little has as yet been done in this direction, and the 
subject of integral transforms of stable distribution still awaits its enthusiasts. 

In the final analysis, the variable s in (2.6.5)-(2.6.9) is taken from some 
subset of the complex plane, but the initial computations should be done 
under the assumption that s is real, with passage to the complex values of s 
only in determining an explicit form of the transform or a simplified expression 
for it. 

The form of expression for a transformation ¥ of the functions f = lg can 
depend essentially on what representation is used for the density. In order 
not to make this section excessively longer, we do not consider all possible 
variants of the integral transform formulas (b)-(d) obtained by using the 
density representations in §2.2, but give preference to those formulas leading 
to the expressions that are simplest in our view. The form of the Fourier 
transforms of stable laws (i.e., the characteristic functions) is known, so we 
begin with the two-sided Laplace transformation. 

(b) For a density g = g(x, a, 8, y, A) (with the system of parameters in the 
form (B)) let 

A(s, a, B, Y, A) = (Ng)/(s). 


It is clear from the asymptotic formulas (2.4.8) and (2.5.4) that in the case 
B A +1 the density g(z,a, 3,7, A) decreases polynomially as z —> œo and as 


112 2. ANALYTIC PROPERTIES OF DISTRIBUTIONS IN G 


xz — —oo. Consequently, the two-sided Laplace transform of the density g 
exists only if s = —it, t € R, i.e., only if it coincides with the characteristic 
function corresponding to this density. For the same reason, it does not exist 
in the half-plane Res > 0 when 6 = —1. The transform A need not be 
considered in the half-plane Res < 0, since, according to Property 2.2, 


A(s, Q, p, Ņ, À) = A(—s, Q, =p, gi A), 
i.e., it suffices to confine oneself to analysis of the case Res > 0, p = 1. 


THEOREM 2.6.1. The following equality holds in the half-plane Res > 0 
(2-14) 


AW 
log A(s, œ, 1, y, À) = | 15 . Anat a á i (2.6.10) 

PROOF. As is clear from the asymptotic formulas (2.5.17) and Remark 3 
after Theorem 2.2.3, the density g(x, aœ, 1, y, À) decreases as r — —oo more 
rapidly than exp(cx) for any c > 0. Therefore, the transform A exists for any 
values of s in the half-plane Res > 0, and is analytic at interior points and 
continuous on the axis Res = 0. 

In this case the characteristic function g(t, a, 1) in §2.2 was given the special 
form (2.2.6) facilitating its analytic extension from the real t-axis to the half- 
plane Im z > 0. If we make the substitution z = ts, Res > 0, in this extension, 
we get the function A(s,a,1,7,A). The equality (2.6.10) follows from this. 

(c) The one-sided Laplace transform of the density g = g(x, a, 3,7, A), 
denoted by 


L(s, aœ, 8,9, à) = (Lg)(s), L(s,a,8) = L(s, a, 8,0, 1), 


exists in the half-plane Re s > 0 for any admissible values of the parameters. 
It suffices to consider only the case Res > 0, since, according to Corollary 
Bik, 

L(s, a, 8, y, A) = L(—s, a, —8, —7, À). 
As already mentioned above, it suffices to get all the formulas for the function 
L(s,a,3,7,) under the assumption that s > 0. They can be generalized to 
the case of complex s by analytic extension. According to the property 


g(x, a, B, q, A) = A7/%g(A~"/% (a — 1), a, B), 
where l = yA if a Æ 1 and l = yà + Bà log A if a = 1, the transform L can be 
written in the form 
L(s, a, B,y, A) = exp(—ls)L(A!/®s, a, 8) 


0 
+ exp(-ls) | exp(—sA!/%xr)g(a, a, 8) dz. 
=pA mia 
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Both terms can be simplified, and in a common way. Thus, to avoid repetition 
we confine ourselves to the analysis of the case l = 0. However, the conclusion 
of these arguments nevertheless leads to an expression for L(s,a, 3,7, A) in 
the case when a > 1. 


THEOREM 2.6.2. For any s in the half-plane Res > 0 the following ez- 
pressions are valid for the one-sided Laplace transforms L(s,a, 3) of standard 
stable distributions (%15): 

Ifa #1, then for any admissible 3 

sin Tp 


1 oo 
L(s, a, 3) = =f oe a re meteor a du. (2.6.11) 


Ifa=1 and 8 > 0, then 
sin Z (1 + B)u 


du. (2.6.12) 
S+U 


L(s,1, 8) = zf exp(—ßu log u) 
T Jo 
If a = 1, then for any admissible 8 


Lis, 1, 8} = -| [s cos( Bu log u) — u sin( 8u log u)] 
0 
y exp(—7u/2) 
Pie 
PROOF. For a ¥ 1 let us substitute in (Lg)(s) the modified expression 


(2.6.4) for the density and change the order of integration, which is allowed 
because the double integral converges absolutely. We have that 


du. (2.6.13.) 


is, 3) = : im f exp(—u%e~*7°/? + inp/2) du 
0 


oO 
«| exp(—sax — rue’™?/?) dz 
0 


= im f exp[—(ue~*7°/?)* + imp/2](s + ue'™/2)—! du. 
0 


Rotation of the contour of integration through the angle —7p/2 and a change 
of variable give us that 
1 = a €P +u 
L(s, Gh B) = 5 Im f exp(—u s+ ueimop du. 
Subsequent replacement of u by su brings us to (2.6.11). 

In the case œ = 1 the arguments are mostly the same. The difference reduces 
only to what expressions we use for the density g(x, 1, 3). To get (2.6.12) it is 
necessary to use (2.2.9), and to get (2.6.13) the density is taken in the form 
(2.2.1b). 
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We make some remarks about Theorem 2.6.2. 

REMARK 1. The rational function under the integral sign in (2.6.11) is 
equal to tgc¢(z, 1,0), as shown by a comparison with (2.3.5a). Since L(s, a, 3) 
= Lo(s,a,0), (2.6.11) can be given the form 


oO 


Lo(s, a, 6) =| exp(—(us)™)gc(u, 1,0) du. (2.6.14) 


If a < 1, then, by (2.2.6) and (2.2.32), 
exp(—s“) = (Lg(z, a, 1))(s) = L(s, a, 1). 


Substitution of this expression into (2.6.14) gives us 
(oe) oo 
Lo(s, a, 8) =i J exp(—suv)gc(v, a, 1)gc (u, 1,0) du dv 
o Jo 


=j exp(=sv) dv | go (2,0,1) ge(u, 1,4). 
0 0 u 


u 


Comparison of the left and right sides of this equality (as one-sided Laplace 
transforms) leads to the following relation between the densities of stable laws 
for the values a < 1 and z > 0 (%16); 


= £ du 
solza) = f go (a1) LE (26415) 


REMARK 2. The only case when L(s, a, 3,7, Aà), Res > 0, can be expressed 
in elementary functions corresponds to the values a < 1, 8 = 1, and y = 0. 
For such values of the parameters, g(x, a, 1,77, à) = 0 on the semi-axis x < 0, 
l.e., 
L(s,aœ, 1,7, A) = A(s,a,1,7,A) = exp(—A(ys + 8%)). 


REMARK 3. In the case a > 1 the transform L(s,a,1) can be expressed 
with the help of the Mittag-Leffler functions (see §2.10 for more details). 

REMARK 4. The transforms L(s,a,3,+7,) with any admissible values of 
the parameters have integral expressions analogous to those given in Theorem 
2.6.2. For example, in the case a > 1 

L(s, a, 8,7, A) = : im f exp{—A[(su)® — sy(ue’”? + 1)]} ai u 

A E n P q 1+ ue’™P 

and in the case a = 1, B>0 


L(s,1,8,4,A) = f exp{—A[8u logu — y(s + u) — s8 log Al} 
0 


sin 5 A(1 + 8)” 
SFU 


du. 
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In the case a < 1 the expression L(s, a, 3,7, ) turns out to be precisely 
the same as in the case a > 1, provided that ycosmp < 0. For ycosmp > 0 
the form of the expression for L(s, a, 3,7, à) changes, but the difference from 
the case a > 1 reduces only to a rotation of the contour of integration by an 
angle w such that ycos(7p — w) < 0. 

The transform (Lf)(s) of the functions f(r) = l(z)g4(r(zx),a, 8) are an- 
alyzed according to the indicated method with the use of the integral ex- 
pressions (2.2.12)—(2.2.15) for the functions g4(r(z),a,3). We do not give 
a general analysis of such transforms, but confine ourselves to two examples 
which are interesting in themselves and are good illustrations of the possibili- 
ties of the method as a whole. Both examples are connected with the densities 
g(x, 1, 8), 8 > 0. 


Consider on the semi-axis z > 0 the function 


f(z) = x~*ga(r(z), 1, 8), 


which, according to (2.2.13), has the integral expression 


1 (© ©) 
f(z)= = im f exp(—u — rT(u)) du 


(the functions r and T are the same as in the general formula (2.6.9)). We 
have 


(LF)\(a) = 1 tm f [i exp(—sz — u — tT (u)) du dz. 
n o Jo 
Since 5 i 
ReT(u) > -t ~ Bul log ul, 


the double integral converges absolutely, and we may change the order of 
integration. Consequently, 


J Da e (= - * Blogz, 1.3) dx 
0 


m 


= mf ks exp{—(s +T)r}du 
n fa 

1, f: = 

pe TH s+T(u) 


1 eo 
=+) f N 


In the second example we consider the function 


f(z) =x ‘g(—Blogz,1,8), B>O, 


=] 


2 
(s + au logu) +(1+ shë udu. 
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which, according to (2.2.19), can be written in the form 


1 
fej= = i Tile B)exp{—20i (2, )} de (2.6.16) 


m(1+ By) T l T 
Ui(p, B) = Zaro exp E (o+ *) tan zo . 


where 


Using (2.6.9), we get (changing the order of integration is obviously allowed 
in this case, because all the functions after the double integral sign are non- 
negative) that 


(LAs) = j ees -1 Whe, Tide 


= al. vle bdo f exp{—(s + Ui (p, 8))x} dz 


-5l j+ JA n dp. (2.6.17) 


The equality (2.6.17) has one interesting analytic corollary. Consider the 
case 3 = 1. According to (2.5.17), g(—logz,1,1) decreases as r — oo more 
rapidly than exp(—z/e). This means that the transform (£ f)(s), as a function 
of s, admits an analytic extension to the half-plane Res > —1/e, i.e., it is an 
analytic function in the disk |s| < 1/e. 

We expand both sides of (2.6.17) in series of powers of s and equate the 
coefficients of the same powers of s. This gives us the system of equalities 
(k=O, 1,005) 


J a*—19(—logz,1,1)d =i f ur (y, 1) dy, (2.6.18) 
0 


which are nontrivial for k > 1. A change of the variable in the integral on the 
left-hand side gives us that 


J e~*%9(x,1,1)d -1f u (p, 1) dy, ei M 


oOo 


The left-hand side of this equality coincides with A(k, 1,1) and, according 
to (2.6.10), is equal to exp(k log k) = k*. Consequently, for any integers k > 1 


af (SP) epee erm Fe} do = (38) 


(d) Consider the Mellin transform of the density g = g(x, a, 3,77, A). Let 
m(s, a, B, Y, À) = (Mg)(s). 
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For 0 < s < 1 we consider the function 
m(—s,a, B, y, À) = J, aoB, A)ade 
Using the inversion formula for the density, we find that 


AEE i= T / J ote g(t, a, B,y, A) di 
0 O 


T 


1 | 
— Re | alta, 8A) dt f tee “= dy 
n 0 0 


“r — s)Re f (it) 'a(t,0, 8,7, A) dt. (2.6.19) 
0 


Of course, these formal transformations of integrals require proofs, but we 
shall not give them here. On the one hand, they are very unwieldy, and, on 
the other hand, formula (2.6.19) will not be used below in so general a form. 
The solution of the same problem turns out to be of considerable interest for 
stable distributions in the class 9, since in this case it is possible to find an 
explicit expression for the function m(s, a, 3,7, A). 

We mention some easily verified facts needed in the course of the subsequent 
arguments. 

1) The transform m(s, a, 3,7, À) is finite for any s in the strip —1 < Res < 
a@ (a consequence of the finiteness of the density g and of the existence of 
moments of order less than a). 

2) Within the class W 


m(s, a, BhA) = A*/“m(s, a, 8,7, 1) 
= d8/"mo(s, a, 9) = \8/"mo(s, a, p) 

(a consequence of Property 2.2 and the definition of the class W; p = (1+6)/2 
is a modification of the parameter @). 

3) If a £1, then 

mc(s,a,p) = m(s,a, 8,0, 1) = m(s, a, 8) 

(see the connection between the parametrization systems (B) and (C)). 

4) The function mc (s, a, p) is jointly continuous in the parameters a and 


p in their domain of admissible values (this fact was mentioned in the Intro- 
duction). 


THEOREM 2.6.3. For any strictly stable law the following equality holds 
in the strip —1 < Res < a (217): 


_ sintpsT(1— s/a) 
es, B) = sints T(1—s) 


(2.6.20) 
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PROOF. Let a Æ 1 and let 3 have any admissible value. Let the expression 
(2.6.4) for the density be substituted in the integral expression for m(s, œ, /3). 
For values -1 < s < 0 


OO 
/ rolna 8) dx 
0 
ere) oo l . TT 
2 im f g dz f exp ( -zuee T ida =e) du. 
T 0 0 2 


It is not hard to verify that the conditions for changing the order of integration 
are satisfied here. We have that 


m(s,a, 8) = Smf exp ( - useriten/? + E) du 


f r° exp(—rue’™?/?) dz 
0 


m(s, a@, 3) 


du 


=T(1+s)- mf (uet"?/?)- ° exp(—u® GPT pe > 


Rotation of the contour of integration in the last integral through the angle 
Tp/2 gives us, after a corresponding change of variable, that 


m(s,a, 3) = Aue + s) sin rps | u`! exp(—u*%) du 
0 


1 S\ , 
= zal + s)P (—=) sin xps 


sin tps [(1 — s/a) 
sinats T(1—s) ` 


Both the left-hand side and the right-hand side of the last equality contain 
functions analytic in the strip —1 < Res < a. Therefore, the equality proved 
for —1 < s < 0 is preserved for values of s in the whole strip. Thus, (2.6.20) 
is proved in the case a # 1. The fact that this equality is preserved for 
a = 1 follows from the continuity property mentioned above for the function 
mo(s,a,p) ata =1. 

(e) The characteristic transformation will be the last of the integral trans- 
formations we consider. Since this transformation cannot be said to be well 
known, we devote somewhat more attention to it than to the others. First of 
all we acquaint ourselves with the main properties of characteristic transforms 
of densities px (x) of arbitrary random variables X: 


a4) | Wolt)x 0 E 
w(t) =( 0 eed = (Wpx)(t). 
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Although this definition assumes the existence of a density for the distribution 
of X, it is easily freed of this restriction by setting 


welt), = E|X|*(agn X)", (Ek, ks UL (2.6.21) 


The transform Wx (t) was first introduced in [101] and later found an appli- 
cation in multiplicative problems in number theory. Characteristic transforms 
of random variables play the same role in the scheme of multiplication of ran- 
dom variables as characteristic functions play in the scheme of summation. 
This analogue is not hard to see in the light of the following properties of 
characteristic transforms. 

1) The characteristic transform exists for any random variable X, as is 
clear from the very definition (2.6.21) of the functions wp(t)x. 

2) The distribution Fy is uniquely determined by the characteristic trans- 
formation Wx. 

Indeed, let 

e = P(X +0), e [= PAX < 0}, 
ctft =E|X|"1(X>0), f(t) = EJXIËI(X < 0), 


where f* and f~ are certain characteristic functions uniquely connected with 
the parts of the distribution F(z) on the semi-axes z > 0 and x < 0 when the 
corresponding coefficients ct and c~ are nonzero. We have that ct + c- = 
1 — P(X = 0), and 
we(t)x =e" fT (t)+(-1)*ef-(t),  k=0,1, 
cft (t) = 5(wo(t)x + wi(t)x), 
cF (t) = 3(wo(t)x — wi(t)x). (2.6.22) 
Consequently, the distribution Fy can really be recovered if we know the 
functions wo and w). 
3) If U and V are independent random variables, then, for their product 
A = UV, 
Wx(t) = Wu (t)Wy (t), tER. (2.6.23) 
We have 
wils EJUV |* (sgn UV )* = wk(t)uwg(t)v, k=0 1, 
which implies (2.6.23). 
4) For any sequence of random variables X ,X1,, X2,... the relation 
L( Fx, , Fx) +|P(X, = 0) —- P(X = 0)| +0 asn-o 
(where L denotes the Lévy metric) holds if and only if 
Wx, (t) -Wx(t) asn— co 


in any finite interval of values t. 
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This assertion is a simple consequence of (2.6.22) and known facts on weak 
convergence of distributions and the characteristic functions corresponding to 
them. 

Suppose that g = g(z,a,3,7,A) and s is a complex number in the strip 
—1 < Re <a. Let 


W (s, wh A) = (Wg)(—ts) 


_ | wo(s, a, 8, 7A) 0 
E 0 wi(s,a,8,7,A) J 


This function exists for any values of s in the indicated strip and any ad- 
missible values of the parameters, and is closely connected with the Mellin 
transform. Namely, 


wols, a, B, 7, à) = m(s, a, b, 9, A) + m(s, a, —B, —7, À), (2.6.24) 
w1(s, a, B,y,A) = m(s, a, 8, y, A) — m(s, a, =B, =Y, A). (2.6.25) 
Therefore, what was noted with regard to the transforms m remains valid also 
for the elements wg of the transform W. Theorem 2.6.3 enables us easily to 


obtain an explicit expression for the functions wz(s, a, B, Y, À) corresponding 
to the distributions in the class W (in the same form (C)). 


THEOREM 2.6.4. The characteristic transforms of the densities of strictly 
stable distributions have the following form in the strip —1 < Res < a (%18): 


cos 3 (k — 0s) T'(1 — s/a) 
cos#(k—s) [(1—s) | 


PROOF. On the basis of (2.6.20)—(2.6.22), 
wy (s, a, B, y, A) = à” "wk (s,a, 6,7,1), k=0,1, 


wg (s, a, 0, Ajo = A8/* k=0,1. (2.6.26) 


wo(s,a,9)c = mc(s, a, 0) + mc(s, a, —0) 
_ sing(1+ @)sT(1 — s/a) 


sin TS r(1- s) 
x sin (1 — @)sT(1 — s/a) 
sin TS (1 —-s) 


_ cos 54s T(1 — s/a) 
~ coszs T(1—s) ` 


In a completely analogous way, 


w1(s,a,0)c = mc(s, a, 0) — mc(s, a, —0) 
_ sin 50s T(1 — s/a) 
~ sings T(1—s) © 
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REMARK. In the general case neither the Melling transforms m(s, a, 3, y, À) 
nor the characteristic transforms W (s, œ, 8, y, à) have explicit expressions. 
This is because the cases of simple expressions for the transforms m and W 
are organically connected with the natural division of the densities of stable 
distributions into two parts. For example, the point z = 0 makes such a 
division in the class W (if we use the parametrization system (C)). 


§2.7. Unimodality of stable distributions. 
The form of the densities 


Among the diverse properties of probability measures regarded as worthy 
of attention in probability theory is one called unimodality. Encountering this 
property in the course of solving various problems usually gladdens statisti- 
cians, and does not leave probabilists indifferent. The property is enjoyed by 
many known distributions, including all stable distributions, as will be shown 
below. 

The term “unimodality” was apparently introduced in mathematical statis- 
tics by the light hand of Pearson. In statistics a “mode” of an absolutely 
continuous distribution is defined to be a point of the real axis at which the 
density attains, roughly speaking, a maximal value. Of course, a distribution 
can have not one but a whole set of modes. However, when statisticians say 
that a distribution is unimodal, they mean that it has precisely one mode 
(see [12]). It is not hard to see that the unimodality of a distribution, under- 
stood in this way, is inconvenient for its formal definition. The definition of 
unimodality commonly used at present was proposed in 1938 by Khintchine 
[44]. 

DEFINITION. A distribution function F(z) is said to be unimodal if there 
exists at least one value z = a such that F(z) is convex for z < a and concave 
for z > a. In this case one says that the distribution F(z) has a mode at 
the point z = a (or, more briefly, that z = a is a mode of F(z)). Here 
the convexity and concavity of F are understood in the broad sense, i.e., for 
example, F is convex on the semi-axis z < a if for any points 21,22 <a 


2F((z1 + £2)/2) < F(z1) + F(z2). 


It follows from the theory of convex functions (see, for example, [30]) that 
a unimodal distribution function F(z) with mode at a point x = a has the 
following properties: 

1) F(z) has left-sided and right-sided derivatives at each point z 4 a, and 
the one-sided derivatives coincide everywhere except for points forming a set 
which is at most countable. 
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2) The derivative F’(x) (where F” is chosen at each point as one of the one- 
sided derivatives) is nondecreasing on the semi-axis z < a and is nonincreasing 
on the semi-axis xz > a. 

Thus, the distribution F(z) is absolutely continuous. As will become clear 
below, the most convenient variant of the density F’(zr) is the one which is 
a left-continuous function (except perhaps for a single point). The class of 
distribution functions with such densities will be denoted by f. We need it 
in formulating general criteria for distributions to be unimodal. 

This definition of unimodality generalizes the original conception of the 
property of unimodality, since a distribution that is unimodal in the sense 
of this definition can have not one but a whole set of modes. This is shown 
by the simple example of the uniform distribution on the interval (0,1), for 
which each point of [0, 1] is obviously a mode. 

It is clear from the definition of unimodality that if points aj < a2 can be 
chosen as modes of a distribution F(x), then each point of [a1, a2] is also a 
mode of F(z). 

The problem of unimodality of stable laws stems from a 1936 paper of 
Wintner [87], where it is proved that all symmetric stable laws are unimodal. 
It should be mentioned that in his consideration of symmetric distributions he 
defined unimodality in the same way as Khintchine, by means of the property 
of convexity of the distribution function on the negative semi-axis. Instead of 
the term “unimodality” he used “convexity”. 

Wintner’s paper could serve as a good basis both for finding a natural 
definition of unimodality of distributions and for posing the general problem 
of unimodality of stable laws. It is known that this problem was discussed in 
1939 at a seminar led by Gnedenko at Moscow University. However, it turned 
out to be very complicated, and the complete solution of it was stretched over 
many years, involving in the process such a number of tragicomic situations as 
perhaps no other problem in probability theory can boast of. There is not yet 
an independent analytic proof of unimodality for stable laws. The property 
was proved in 1978 by Yamazato [94] as a corollary of a more general fact: 
the unimodality of laws in the class L (%19). 

The main content of this section is a presentation of Yamazato’s theorem 
in a somewhat simplified version connected with the subclass © of L. But to- 
gether with it we give proofs of particular results obtained earlier by Yamazato 
and by other methods. On the one hand, such information will be useful to 
anyone who wants to look for an independent analytic proof of the unimodal- 
ity of stable distributions. On the other hand, familiarity with the methods 
used may serve well in solving the more complex problem of unimodality for 
multidimensional stable laws. 
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We begin by giving several general criteria for unimodality of distributions. 
Consider first of all that properties 1) and 2), which follow from the definition 
of unimodality, can be regarded as necessary conditions. And conversely, by 
starting from properties 1) and 2) it is not hard to see that a distribution 
function F(z) having these properties is unimodal with mode at the point 
x =a. We thereby obtain the following criterion. 


THEOREM 2.7.1. A distribution function F(x) is unimodal with mode at 
the point x = a if and only if it has properties 1) and 2). 


For each distribution function F € ¥ we define on the real z-axis the 
left-continuous (except perhaps for a single point) function 


V(x) = F(z) — xF’ (z). 


THEOREM 2.7.2. A distribution function F € f is unimodal with mode 
at the point x = 0 if and only if the function V corresponding to it is a 
distribution function (?:?°). 


PROOF. Necessity. Consider the differential of V(x), understood as the 
value of the jump at points of discontinuity. Then at all points z +0 
dV (x) = dF (x) — F' (x) dx — xdF'(x) = -z dF' (zx). (2.7.1) 


Since F is unimodal with mode at z = 0, the right-hand side of (atl) is 
nonnegative. Consequently, V(x) is nondecreasing on the semi-axes z < 0 
and x >Q. 

If z > 0, then 


0 <2F"(2) <2 J F'(u) du = 2[F(2) — F (2/2) 
#42 
< 2min{1 — F(x/2), F(x) — F(+0)}, 
i.e., 2F"(r) + 0 as z +0 or z > oo. Analogous arguments in the case z < 0 
lead us to a conclusion which, combined with the treated case x > 0, can be 
given the form 


tF"(r) +0 if |z| +0, x £0, or |z| > o0. (2.7.2) 
Using the expression for V(x) and (2.7.2), we find that 
V(+0) = F(+0) > F(—0) = V(-0), 
1 — V (œ) = 1 — F(o0) = V(—o0) = F(—o0) = 0. 


Together with the property dV (x) > 0 (x # 0) established above, these 
equalities give us that V (zx) is a distribution function. 
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Sufficiency. Let F € h and suppose that the function V (x) associated with 
it is a distribution function. Then (2.7.2) obviously holds, and a particular 
case of it is the relation F’(r) — 0 as |z| — oo. Using this property and 
integrating the transformed equality (2.7.1) 


dF'(z) = -ŻdV (2), cAi 
from z to +00 sgn z, we find that 
+oo sgn T 1 
F(x) = | WV (v), ¢ #0. 


This makes it clear that F’(z) is nondecreasing on the semi-axis z < 0 and 
nonincreasing on the semi-axis x > 0, i.e., F(z) is unimodal with mode at 
z= 0. 

The following general criterion for unimodality of distributions is formu- 
lated in terms of random variables (recall that the symbol £ means equality 
of distributions) (72-1). 


THEOREM 2.7.3. A distribution Fx of a random variable X is unimodal 
with mode at the point x = 0 if and only if X can be represented in the form 


X£XZ, (2.7.3) 


where Xo and Z are independent random variables, with Xo uniformly dis- 
tributed on the interval (0, 1). 


REMARK. The condition that X can be represented as a product 
x2 x2 (2.7.4) 


of independent random variables, one of which (say X’) has a unimodal dis- 

tribution with mode at zero, is also a criterion for the unimodality of Fx. 
Indeed, the fact that (2.7.4) is a necessary condition follows from the ne- 

cessity of (2.7.3). Let us verify the sufficiency of (2.7.4). If X’ has a unimodal 


distribution, then X’ £ XZ", according to (2.7.3). Consequently, 
K2 yo! 2 neris. 


This and the sufficiency of (2.7.3) give us that Fx is unimodal with mode at 
zero. The criterion (2.7.4) is more attractive than (2.7.3) only as a sufficient 
condition for unimodality. 

PROOF OF THE THEOREM. According to the condition of Theorem 2.7.2, 
the distribution function Fy of X is unimodal with mode at zero if and only 
if there is a random variable Z such that Fz(z) = Fx(xr) — 2F{(z). 
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Let F(x) be unimodal with mode at z = 0. We compute the characteristic 
transform Wz(t). By (2.7.1), 


we(t)z = E|Z|” (sgn Z)* 
- J |z|**x(sgn x) dF% (z), k=0 i. 
r40 
Integration by parts and property (2.7.2) give us that 


oo J „lt een a) dF (2) 
z40 


= (1 + it) J |z|** (sgn x)* FX (x) dx 
= (1+ 2t)w,(t)x, We 0,1. 
Consequently, for allt € R 
Wz(t) = (1+ 1t)Wx(t). (ATD) 


Simple computations show that the characteristic transform of a random vari- 
able Xo uniformly distributed in (0,1) has the form 


Wx, (t) = (e id P A (2.7.6) 


and so it follows from (2.7.5) and (2.7.6) that 
Wx (t) = Wx (t)Wz(t). (2.7.7) 


The representation (2.7.1) is now obtained as a consequence of (2.7.7) and 
property 2 given at the end of the last equation for characteristic transforms. 

Assume the opposite, i.e., that X can be represented in the form (2.7.1). 
Then 


If dFz(x) is understood as the value of the jump of Fz(x) at points of 
discontinuity, then the last equality gives us by differentiation that 


+oo sgn x 1 
dF x(x) = J > dFz(u)dz. 


This means that F(x) is nonincreasing on the semi-axis x > 0 and nonde- 
creasing on the semi-axis x < 0, i.e., Fy is unimodal with mode at zero. 
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COROLLARY 1. The distribution Fx 1s unimodal with mode at the point 
xz = 0 if and only if the product (1 + 7it)W x(t) is a characteristic transform. 


COROLLARY 2. Suppose that p(x) is the density of some unimodal dis- 
tribution F with mode at the point x = 0. Then the distribution Fx with 
characteristic function f x(t) is unimodal with mode at zero if and only if 


ix(t) = | pluie (tu) du, (2.78) 


where fz(t) is a characteristic function. 


In particular, if F is the uniform distribution on the interval (0,1), then 
condition (2.7.8) has the form 


1 t 
ix(t) => | fa(u) du 
0 
(This criterion was found by Khintchine [26].) 


LEMMA 2.7.1. Let Fy, Fo,... be a sequence of unimodal distribution func- 
tions with modes at the respective points a,,a2,.... If Fa converges weakly to 
the distribution function F as n — oo, then F is unimodal with mode at the 
point a = limn_.o SUP An. 


PROOF. Unimodality of F,, at a, means that for any zı and 72 to the left 
of an 


2F,((21 + 22)/2) < Fn(z1) + Fy(2z2) 


and that the reverse inequality holds for points to the right of an. We choose 
a sequence of positive integers ną such that an, — a as k — oo. Then any 
two points zı and z2 to the left of a are to the left of the points an, for all 
sufficiently large k. Passage to the limit Fa, — F as k — oo shows that the 
limt function F has analogous properties of convexity and convexity, i.e., 


2F((z1 + r2)/2) < F(z) + F(zx2) 


for points to the left of a, and the opposity inequality for points to the right. 
But this means that F is unimodal with mode at a. 


LEMMA 2.7.2. If Fi and F> are symmetric unimodal distribution func- 


tions, then their convolution F = F; * Fz 1s also symmetric and unimodal 
(2-22), 


PROOF. Since the functions F}; are symmetric, i.e., 1 — F;(x) = F;(—zx) at 
all points of continuity, unimodality is equivalent to the inequality 


2F;((£1 + 22)/2) > F; (21) i F;(z2) 
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for any points zı > x2 > 0. We form the function 
Gr(y) = Fi (y) — żlFi (y +h) — Fı (y — h)). 
It is not hard to verify that 
Gry) = -Gr(-y),  Gnrly)sgny > 0, 


and that d|Fz(y — x) — F2(y + x)| > 0 on the semi-axis y > 0 for any z > 0. 
Let zı = z + h and z2 = z — h, z > h > 0. Then 


- Gn (y) d[F2(y — 2) — Fo(y +2)] > 0. 


Using the definition of the function Ga (y), we can transform this inequality 
to the form 


F((z1 + 22)/2) = [Fe + £2)/2 — y) dFo(y) 
> | SFile -) + Fı(z — v)] dF(u) 


= 5 [F(z1) + F(22)] 


Since a convolution of symmetric distributions is symmetric, this inequality 
means (as remarked above) that F is unimodal. 

REMARK. Without the condition of symmetry for the distributions F; the 
statement of the lemma is false, as shown by examples (the first such example 
was given by Chung [10]). Here is one of the simplest examples: 

Let pa(x) be the density of the uniform distribution on (0,a). For a < b 
let 

q(X) = €pa(z) + (1 — €)po(z). 
It is easy to see that this is the density of a unimodal distribution with mode 
at zero (each point in [0,a] is a mode of q). We consider the density 


w(x) = (q*q)(z) 
= "pa" (x) + 2e(1 — €)(Da * po)(x) + (1 — €)*pp*(z). 


The density pģ* (x) has the form of an isosceles triangle with base (0, 2a) and 
altitude 1/a. Consequently, 


E? (1 = €)? 2a (1 _ g) 
w(a) > m w(b) > a J w(x)dz > o 


Assume that w(x) > w(a) in the interval a < x < b. Then 


[oe dz > (b—a)w(a) > cal. —¢7, 
0 a 
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whence 


(=ef 


2b 2 
Eb 3 
J w(z)dr > ~~ -€ JE 9 


If now £ = 4,a = > and b = 1, then we see that the lower bound is greater 
than 1. This means that the assumption that w(x) > w(a) is not justified for 
this choice of parameters, i.e., between a and b the density w(x) takes values 


less than w(a) and w(b). The distribution for w(x) is thus not unimodal. 
THEOREM 2.7.4. Symmetric stable distributions are unimodal. 


PROOF. The case a = 2 corresponding to a normal distribution is obvious, 
and without loss of generality we can confine ourselves to an analysis of the 
case 0 <a < 2. 

According to Theorem C.1, the canonical form of the expression (1.17) for 
the characteristic function g(t, a,0,0, A) = exp(—A|t|%) of a symmetric stable 
law has the form 


log g(t, a, 0,0, A) = J (costz — 1)H' (x) dz, 
t#0 


where 
T 


2 


a 


H(z) =¢e|z|->-* = ave + a) sin -a|z 


We consider a sequence of series of mutually independent (in each series) 
random variables Xnk, 1 < k <n, n=1,2,..., having the symmetric distri- 
bution function 


Faz(2) =1- Fn (—2) 


A(z), tem 
nlH(-4)+H'(-2)@t+ a)  -—_<2<0. 


For ne > 1 
P(|Xnk| > €) = 2H(-e)/n -0 asn— oo 


uniformly with respect to k, i.e., the random variables X,,, satisfy the con- 
dition (I.2) of limiting uniform smallness; hence the limit distribution of the 
sum Xn = Xni+:::+Xnn can be determined by using Theorem B. For z < 0 


nF ng(z) = n{l — Fnk(—r)]| > H(z) asn—- ov. 
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Moreover, 
dn = nVa Xnj1(|Xnj| < €)) 
1/n 1 E 
= 2f a? H' (<) dz +2/ x? H'(x) dx 
0 n 1/n 
9 E 
MEN | +20 | a)—* dr 
3 n3 n 1/n 
_ “a2 2€ 2-0 a—2 
= cn + is mG y=) 
Consequently, 


he "O 
lim lim supo;, = 0. 
E—>0 n—- 00 


Therefore, on the basis of Theorem B, 
Falz) = P(Xn < x) — G(z, a, 0,0, 2) 


as n — OO. 

The distributions F,,, are symmetric and unimodal by construction. Con- 
sequently, by Lemma 2.7.2, their convolution F,,(r) is also symmetric and 
unimodal. To obtain the assertion it now remains to apply Lemma 2.7.1 to 
the sequence {Fp} of distributions. 

We now use a purely analytic method to prove unimodality of the extremal 
stable laws (i.e., those corresponding to the values 3 = +1). By Properties 2.1 
and 2.2, the proof of unimodality for the distribution functions G(z, a, 3, y, ) 
reduces to an analysis of the extremal standard distributions G(z,a,1). Fur- 
thermore, it is clear that the case œ = 2 need not be considered, and it can 
be assumed that 0 < a < 2. 


THEOREM 2.7.5 (7:23). Extremal stable distributions are unimodal and 
have exactly one mode. Moreover, in the case of a standard distribution 
G(z,a,1) the corresponding mode is positive if a < 1 and negative if 1 < 
a < 2. 


The proof is based on the representation (2.2.35) of the function g'(x, a, 8), 
0<a<2,8=+1,2>0. In the course of the proof we use some facts which 
we isolate as individual lemmas, with the notaiton in (2.2.35). 


LEMMA 2.7.3. Ifl1<a< 2, then g’(x,a,1) <0 for all z > 0. 


PROOF. According to (2.2.35), 


-r /a 


AENA IE ze | bexp(—k®“a) dy, (2.7.9) 


— TT 
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where the function 
sin p 


b(~) = (a — 1)~* Pare 


2/(e-1) / a sin(2 — ay aj 
sin ay 


is easily seen to be negative on the interval (—7,—7/a). Therefore, the inte- 
gral (2.7.9) is also negative. 


LEMMA 2.7.4. Ifl1<a< 2, then 
g’(0, a, 1) = —g'(0,a, —1) = AT(1+ 2/a) sin(27/a) < 0. 


PROOF. Using the representation (2.2.10) for the density g(z,a,), we 
find that 


1 00 | | 
g'(xz,a, B) = p- im f yexp(—y* — rye? + 2rip) dy. 
0 
This gives us that for 8 = +1 


g'(0, a, 8) = -5 sin anp | uexp(—u®) du 
0 


= 1 inor (=) = By (1 + z) Pn < 0. 
ant Qa 2T Q Q 


As is clear from its definition, the function b(p) has as factors a function 
that is trivially positive in the corresponding interval (u,v), and the function 
asin(2 — a)p 


sgn(1 — a) ( — -1) ifa #1, 


2pcoty — 1 if a = 1, 


w(p) = 


whose behavior is not obvious. 


LEMMA 2.7.5. On the corresponding interval (u,v) 

1) the function a(p) is positive and monotonically increasing, and 

2) the function w(p) is monotonically decreasing if a < 1 and monotoni- 
cally increasing if a > 1. 


PROOF. Consider the function h(a, p) = acot ay — cot p in the domain 
0< p< min(z,7/a),0<a< 2. In this domain 


L = (2sin? ay)~!(sin 2ayp — 2ap) < 0; 


therefore, for fixed p the function h(a, p) decreases as œ increases. Since 
h(1, p) =0, 
h(a,p)>0 if0<a<1l, 


ko peto ifl<a< 2. eeta 
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The case when a < 1 and 8 = —1, for which u = v, corresponds to the 
trivial situation (g(z,a,) = 0 on the semi-axis z > 0) and thus does not 
contradict the assertion of the lemma. 

In the case when a@ < 1 and 8 = 1 we have that u = 0 and v = 7, and 


ate sin ap \ VOÀ sin(1 — a)y 
~ \ sing sin ay 


asin(2—a)yp 


- — 1. 
sin ayo 


w(~) = 
(2.7.10) gives us that 
a'(p)/alp) = [ah(a, p) + (1— a)h(1 — a, 9)]/(1 — a) > 0. 


Since a(y) is obviously positive in (0,7), it follows that a'(p) > 0. 
Further, by (2.7.10), 


w'(~) = a[h(2 — a, p) — h(a, p)] < 0. 
If a > 1 and 6 = —1, then u = 0 and v = 7/a, and on this interval 


a(p)_ a 

ap) a-l 
by (2.7.10). On (0,7/a) the function w(y) = —1 + asin(2 — a)p/ sin ay has 
a derivative whose sign coincides with that of the function 


h(a, p) T h(a Ta 1,9) >0 


a|-h(a, p) + h(2 — a, p)| > 0. 


Suppose now that a > 1 and 6 = 1. In this case u = —r and v = —r/&, and 
sin p dees sin(1 — a)p 
a(o) = (- 22 mole 
sin ay sin ay 
piet _asin(2 - a)p i. 
sin ay 


The function a(y) is even. Therefore, it suffices for us to prove that it is 
decreasing on (1/a,7). We have that a’(~)/a(y) = —q(~)/(a — 1), where 
q(v) = a* cot ap — cot y — (a — 1)? cot(a — 1)y 
_ a sin? p + 2acos(a — 1)psin ysin(—ay) + sin? ay 
o sin(a — 1)psin psin(-ap) l 
The numerator of the fraction is positive on (7/a,7), and the denominator 


can be estimated from below by the quantity 


a? sin? p — 2asin ysin(—awp) + sin? ay = (asin p + sinay)?. 
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This implies that a(y) is decreasing on (1/a,7). The function w(y) is also 
even. Since 
w'(p) = a[-h(2 — a, p) + h(a, p)] < 0 


on (t/a, T), we get that w’(y) > 0 on (—7, n/a), which is what was required 
to prove. 

If a=1(u=0,v=7), then the sign of 8 does not influence the values of 
the functions a(p) and b(~). We have that 


a'(p)/alp) = 1/p — 2cot p + p/ sin? p 
= (p sin? p)! (sin p — pcos)? +> 0, 


w'(p) = 2cot p — 2p/ sin? p = (sin 2p — 2p)/ sin? p < 0. 


REMARK. By a direct substitution and a subsequent simple computation 
it is possible to verify that e(a)w(u) > 0 and e(a)w(v) < 0 in all cases 
considered (except for the trivial case a < 1, 8 = —1 and the case a > 1, 
3 = 1). This means that the function e(a)w(~), and with it e(a@)b(y~), changes 
sign precisely once from plus to minus at some point u < g < v. 

Let us proceed now to the proof of the theorem itself. If a < 1 and 8 = —1 
or a > 1 and 8 = 1, then g(z, a, 3) cannot have a maximum on the semi-axis 
xz > 0. In the first case this is obvious, and in the second it follows from 
Lemmas 2.7.3 and 2.7.4. We consider the cases a < 1, 8 = 1 anda > 1, 
8 = —1. On the semi-axis z > 0 the density g(z, a, 3) can have only an odd 
number of extremal points, and the number of minima is less by 1 than the 
number of maxima. This follows from the fact that g(0,a,1) = 0 in the first 
case, while g’(0,a,—1) > 0 in the second (by Lemma 2.7.4). 

Let zo > 0 be one of the extremal points of g(x, a, 3), i.e., g'(z0, a, 8) = 0. 
We show that g’’(zo, a, 3) is necessarily negative at this point. By (2.2.35), 
for z > 0 


g" (x, a, B) “KK | bexp(—ka) dip 


P Shetty f ab exp(—k%a) dy 


(Oe '(z,a, 8) + ŽK®+(-e(a)k') 


a(e(a)b) exp(—k“a) dp. 
J 
The first term vanishes at z = zo, while the second term, in which —¢(a)k’(z) 
> 0, can be estimated as follows for a # 1 and z > 0. By Lemma 2.7.5, in the 
cases we are considering the function a(y) > 0 is monotonically increasing on 
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(u,v), while €(a@)b(~) is monotonically decreasing and changes sign at a point 
u <ø <v. Consequently, 


v 


a a(eb) exp(—k*a) dp +f a(eb)exp(—k®“a) dp 


o 
v 


< a(o) [ ebexp(—k%a) dp + a(o) | ebexp(—k%a) dp 


= a(o) | ebexp(—k*a) dy. 


Hence, 


Vv 


g" (20,08) < 2h2+1(—e(a)k’) J e(a)bexp(—k%a) dio 


u 


= —ak%!k'g'(z0,@, 8) = 0 


at the point z = zo, which is what was required. 

Thus, g(z, a, 3) cannot have local minima on the semi-axis z > 0, but has 
a single maximum. 

Combining the established facts, we conclude that in the case a Æ 1, 3 = 
+1 the density g(x, a, 3) has exactly one maximum at the point z = m(a, 3) = 
—m(a,—); moreover, m(a,1) > 0 if a < 1 and m(a,1) <0 ifa>1. 

The case a = 1 admits similar analysis. However, it is not necessary to do 
this. The fact of the matter is that the property of unimodality for G(z, a, 3) 
is not violated under any linear transformation of the argument of G, hence 
not in passing from one parametrization form to another; for example, not 
in passing from Gg to Gm and back. Therefore, Gm(z,@, BM, YM, àM) = 
G(z,a,3m,7B,AB) is a unimodal distribution for any sets of parameters. 
Let us fix By, ym, and Ay and let a go to 1. Then, by Property 2.4, 


Gm(z,a, BM, JM, ÀM) — Gm(z, 1,8M, JM, àM). 


The distribution Gm (z, 1, 8M, YM, Àm ), as a limit of unimodal distributions, 
is itself unimodal for any Gy, ym, and Ay. An obvious consequence is the 
unimodality of the distributions G(z,1, 8), @ = +1. 

This conceptually interesting and purely analytic proof has not yet been 
extended to the whole family of stable laws. The proof of Yamazato’s theo- 
rem, to which we now come, is based on completely different considerations 
which turned out to be so general that in the final analysis they led to the 
establishment of unimodality for all laws in the class L. The proof reveals that 
the mainspring of the mechanism for forming the property of unimodality in 
infinitely divisible laws is a property of distributions discovered as far back as 
1956 by Ibragimov [33] and called by him strong unimodality. 
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A distribution function F(z) is said to be strongly unimodal if its compo- 
sition with any unimodal distribution is unimodal. It was established in [33] 
that F is strongly unimodal if and only if the function — log F’(z) is convex, 
and that the set of all strongly unimodal distributions is closed with respect 
to complete convergence. A normal distribution is an example of a strongly 
unimodal distribution. 


THEOREM 2.7.6. Each stable distribution is unimodal. 


We precede the proof of the theorem by several lemmas. Consider an 
infinitely divisible distribution F,,(z) with characteristic function 


fn(t) = exp { [te — Dula} , (2.7.11) 


where vn (zx) is a step function of the form 


Eite tên if0<2<cen"}, 
valt) = < Ep t---+bn ife(k—1)n—! <2 < ekn t, 
0 iés, 
which is determined by a set of positive numbers £1,...,&, with € = & + 
--+ + Ên > 4 and a positive number c. As t — oo 


lfn(t)| = exp »» Ek [ (cos £z — 1) ‘| 


k=1 

2 . ck \ dz mn _ 
< eie f cos (22) peig clost} < pP as 

=1 


hence the distribution F,, is absolutely continuous, and its density pn has a 
continuous and uniformly bounded derivative on the whole axis. Each function 
Fn is easily seen to be the weak limit as € — 0 of the generalized Poisson 
distributions FE with characteristic functions 


Ein = exo { [ce = Duala) : e>0. 


T 


And since each of the distributions F£(xz) is concentrated on the semi-axis 
x > 0, the same can be said of their limit distribution Fn. 


LEMMA 2.7.6. Any infinitely divisible distribution with finite expectation 
and characteristic function of the form 


it) = exp { f oo 1) dH(a) 
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where the spectral function H is unbounded and absolutely continuous, has a 
density p that is a solution of the integral equation 


p(x) = ie p(x — y)yH' (y) dy. (2.7.12) 


PROOF. The fact that a distribution with the indicated properties of H 
has a density follows from results in [100]. As is clear from [47], the existence 
of the expectation of this distribution is equivalent to the convergence of the 
integral f |x| dH (z). 

Let us consider the Fourier transform of the function zp(x). We have that 


f c= p(z) de = ti'l) = ~af(t) - Ž / (e2 — 1)H' de 
0) e"*sH (xr) dx 


= [ew dz | p(x — y)yH'(y) dy. 
y#0 


Comparison of the Fourier transforms at the beginning and the end of the 
chain of equalities gives us (2.7.12). 

REMARK. The distribution F, obviously satisfies the conditions of the 
lemma, and thus the corresponding density pn is a solution of (2.7.12) with 
the function H'(x) = v,(x)/x. Differentiating this equality, we get after a 
simple transformation 


xp,,(z) = (E — 1)pn (1 D? EkPn (z < +), (2.7.13) 


LEMMA 2.7.7. The distribution F, is aoad with mode at the point 
Ma > ejn. 


PROOF. Since pp(z) = 0 for z < 0, (2.7.13) has the form zp! (z) = 


(€ — 1)pn(x) on the interval 0 < x < c/n, and this implies that 
Pn(t)=cyt8*, = O< a <e/n. (2.7.14) 


It follows from (2.7.13) that pn(z) > 0 as z — oo. Together with (2.7.14), 
this means that p,(z) has at least one relative maximum on the semi-axis 
z > c/n. Denote by zo the infimum of the set A of all relative maxima on the 
semi-axis z > c/n. Two situations are possible. 

Case 1. The point zo is an isolated point of A, or Zo is itself a relative 
maximum. Assume that p,,(z) is not nonincreasing on the semi-axis x > Zo. 
Then on this semi-axis there is at least one relative minimum of pp (£). Let z1 
stand for the infimum of all such relative minima. Obviously, xı > zo > c/n 


and p,,(1) = ph (z0) = 0. 


136 2. ANALYTIC PROPERTIES OF DISTRIBUTIONS IN G 


Consequently, we can make the following assertions: 

1°. pn(z) is strictly increasing on (0, zo). 

ou he = 1) pn (Zo) — De EkPn(Zo — ck/n) = 0. 

3°. Palt) is PA decreasing on (Zo, 21). 

4°. (E —1)pn(21) — oper EkPn(21 — ck/n) = 

If zı —c > Zo, then zı — ck/n > Zo for all ae k < n. From this and 
from properties 3° and 4° we get 


0 = (£ —1)pn(z1) -D earn zı — ck/n) 


= D Ek(Pn(z1) = Pn(21 z ck/n)) Prn(T1) <0, 


which is impossible. 

Suppose now that zı —c < zo. This means that there is a j < n such 
that 2; — ck/n > zo if k < j and zı — ck/n < zo if 7 < k < n. In this case 
properties 3° and 4° lead to the inequality 

IIET ACTI = èlf + 1)/n) aie 5 Fab l(®i = c) 
< (j1 e k a En E 1) Pal ti) 
Since zo — ck/n < 21 —ck/n < Zo for 7 < k < n, it follows from property 1° 
that 
Pn(Xo — ck/n) < pn(zı — ck/n), (La 
Together with the inequality p,(zo — ck/n) < pn(zo), 1 < k < j (which also 
follows from 1°), the preceding two inequalities give us that 


n 
(€—1)pn(to) > X` E&kpn(zo — ck/n), 
k=1 
which contradicts 2°. 

Case 2. Suppose that zo is a limit point of the set A but is not in A. In 
this case it is possible to choose a relative maximum point 25 > Zo and a 
larger relative minimum point zı > zg such that zı — c/n < zo. Since p,(z) 
is strictly increasing on (0, zo), it follows that p,(xzj — ck/n) < pn(z1ı — ck/n) 
for 1 < k < n. Moreover, p(z) = ph(z1) = 0 and palz) > pn(z1). We 
have 


(€ — 1)pn(z1) < (E — 1)pn(z0) 


= => ExPn (xg — ck/n) < - EkPn(Z1 — ck/n), 


k=1 
which contradicts acu ie Consequently, the distribution F» has a unique 
mode at the point Mn = To > c/n. 


§2.7. UNIMODALITY 137 


LEMMA 2.7.8. The density p,(x) is logarithmically concave on the inter- 
val (0, Mn). 

PROOF. Recall that the condition € > 4 occurs in the definition of F,,. It 
follows from the continuity of p},(x) on the whole axis and equation (2.7.13) 
that p} (x) is continuous on the semi-axis x > 0. Differenitating both sides of 
(2.7.13), we get that 


xpi (x) = (€ — 2)pn (z yee 


k=1 
From this and (2.7.13) we obtain 


rB(x) = 2[(p,,(2))* — pn(x)pn(2)| 
= Pn(x)pn(z) + $ Exlpn(z)p', (a — ck/n) 


n=1 
= Pp(2)Pr(x—ck/n)}. (2.7.15) 
It is not hard to see that B(x) is continuous on the semi-axis z > 0 and 
positive in the interval 0 < z < c/n. We show that B(x) > 0 also for 
c/n < £ < mn. Indeed, suppose not, and let rą, = min{z: B(x) = 0, £ > c/n}. 
Then B(z) > 0 for 0 < £ < gx, pi(z.) > 0, and c/n < Za < mn. Consider 
the following two possible cases. 
1*. For some k with 1<k<n, 


A = Pn (24 )Pn (24 — ck/n) — py (2x) Pn(z. — ck/n) < 0 
2*. A > 0 for any k with l<k<n. 
In the first case ck/n < x, and pn(£, — ck/n) > 0. Consequently, 
Dn(t. —ck/n) _ Prlz.) 
Pn(t.—ck/n) ` Pnz.) 
This means that there is a point z’ between z, — ck/n and z, such that 
(ph /Pn) (x) > 0, which is equivalent to B(x’) < 0. However, this contradicts 


the choice of z,. 
In the second case, as is clear from (2.7.15), 


Pn(2x)p' (T, — ck/n) — py (T+)Pn(t« — ck/n) = 0 

for all k. Since c/n < z., it follows that p,(z. — c/n) > 0, and hence 

Pr(t-— c/n) _ Prlz) 

Pn (T gy c/n) Pn (Ta) 
Hence there exists a point z’ in the interval (£, —c/n, x.) at which (p! /pn)(z') 
= 0, i.e., B(x’) = 0, which again contradicts the choice of z,. It has been 
proved that B(x) > 0 in the interval 0 < £ < my, and this is clearly equivalent 
to the assertion of the lemma. 
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LEMMA 2.7.9. Suppose that absolutely continuous distributions with den- 
sities p(x) and q(x) have the following properties: 

a) p'(x) and q'(x) exist and are continuous on the whole azis. 

b) p(z) = 0 for x > 0 and q(x) = 0 for z < 0. 

c) p(x) and q(x) are unimodel with modes at the respective points —a < 0 
and b > 0. 

d) p(x) and q(x) are logarithmically concave on the respective intervals 
(—a,0) and (0,6). 

Then the convolution of these distributions is unimodal. 


PROOF. Two cases are possible: a < b or a > b. The symmetric nature of 
the conditions tells us that we can confine ourselves to an analysis of one of 
these cases, say the first. Consider the convolution density 


s(x) = | "pe-a J ” gla —y)ply) dy. 


=x 
The density s(x) also has a continuous and also bounded derivative 


T 


s'(x) = J j p(z — y)q'(y) dy = / q(x — y)p'(y) dy. (2.7.16) 


— 00 
It is clear that the derivative is of constant sign on the semi-axes (—00,a) and 
(b, co); therefore, it suffices to prove uniqueness for the maximum of s(x) for 
—a < qx < b. We can assert the following. 

1°. If s'(x) < 0 for some z € (0,b — a), then s'(y) < 0 for all y € (z,a — b). 

2°. If s'(x) > 0 for some z € (—a,b — a), then 3'(y) > 0 for all y € (—a, z). 

Indeed, suppose that 0 < € < a, and let 

_ [a(z +e)/al(x) ifq(z) > 0, 
A= | 0 if q(x) = 0. 

The function A(x) is continuous, because q(x) is continuous. Since q(x) is 
logarithmically concave on (0, b), A(x) is nonincreasing on (0,b— €). Further, 
since q(x) is nondecreasing on (0, b) and nonincreasing on (b, o0), A-(z) is also 
nonincreasing on (b — £€,a). Therefore, A(x) < 1 for x > b, and A(x) > 1 
for r < b—e. 

Suppose that the point zo € (0,b — a) is such that s’(r9) < 0. By (2.7.16), 


(2 +2) = | diada Pilon ehcp 


0 
+ i p'(y)Ae(zo — y)a(zo — y) dy. (2.7.17) 
Since A, is continuous and nonnegative, the function p’(z) has constant sign 
on each of the intervals (—oo, a) and (—a, 0), and the function p’(y)q(zo —y) is 
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integrable, it follows from (2.7.17) and the integral mean value theorem that 


a 


Ce ed a J dy alan -ody 


0 
+ Acting +70) f p'(u)a(zo - y)dy, (2.7.18) 


-Q 
where To < to +n2 < To +a < band zo +a < zo +n. If we choose £ > 0 such 
that b— a — zo > 0, then zo +m < b— € and, consequently, Ae (zo + 72) > 1. 
If ro + m1 < a, then, since A(x) is nonincreasing on (0, b), 


Ae(To + 2) > Ae(zo + m1). 
But if to + nı > a, then 
Ae(To = n2) 1 A; (zo + nı). 


It follows from (2.7.16) and (2.7.18) that s'(zo +€) < 0 for 0 < £ < b-a — to, 
i.e., assertion 1° is valid. 

Further, let zo E€ (—a,b — a) be such that s'(zoọ) > 0. By (2.7.16), for all 
E<2o + @ 


=@ 


s'(zo — €) = [Ae(zo — € + m1)? f p'(y)q(zTo — y) dy 


+ [Ae (T0 — € + n2) 7} / p'(y)a(xo — y) dy, 


=Q 
where 0 < To — n2 — E€ < To +a — € < To + nı — € and to +d- E <b-—e. By 
the same arguments as above, this gives us the inequality s'(zọ — £) > 0 for 
0 < € < To +a, and with it the confirmation of 2°. 

Note that the condition b — a > 0 was not used in the course of the proof. 
Hence, by the symmetry of the condition in 2°, we can get the following 
assertion from it as a corollary. 

3°. If s'(x) < 0 for some z € (b — a,b), then s’(y) < 0 for any y € (x,b). 

The condition that s(x) is unimodal is now obtained as a simple conse- 
quence of properties 1°-3° (under the assumption that b — a > 0). 

We now proceed to the proof of Theorem 2.7.6 itself. It is clear that we can 
confine ourselves to standard distributions G(z, œ, 8) without loss of gener- 
ality. According to (2.2.5), the characteristic functions of these distributions 
have the form 


i or we dx 
log g(t, a, 8) = ita + aC; J (e7 — 1 — itsin r)a 
0 


kica dx 
+ acs | (e? — 1 — itsin 2e i 


— 00 
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when a # 2. The functions 2~% (x > 0) and |z|% (x < 0) can obviously be 
approximated by functions of type Vn in such a way that the corresponding 
sequences of distributions F4 (x) (x > 0) and F; (x) (z < 0) have the following 
property for any t when the constants a, are suitably chosen: 


co 0o d 
itan +f (et? — Nut (2) +f (e7 — 1)v= (x)= 
0 T =00 n 
— log g(t,a,B) asn —> œ. (2.7.19) 


If n is sufficiently large, then the distribution F,*(z) and the distribution 
1— F7 (—z) fall under Lemmas 2.7.7 and 2.7.8, and, consequently, F* and F; 
satisfy the conditions of Lemma 2.7.9. Therefore, the distribution Ft +F; ì8 
unimodal for any sufficiently large n. This and (2.7.19) tell us that the limit 
distribution G(z, a, 3) is also unimodal. 

REMARK. The method of proof does not allow us to see where the mode of 
G(x, a, 3) is. This deficiency of Theorem 2.7.6 (in comparison with Theorem 
2.7.5) can be removed in the case a # 1 by employing the same argument 
used in the proof of Theorem 2.7.5. Namely, for any admissible values a # 1 
and 2, 

g'(0, a, 8) = +T (1+ 2/a) sin[t8K(a)/al. 


Since the mode m(a, 8) of the distribution G(x, a, 3) is unique, this implies 
that in the case a Æ 1 


sgn m(a, 3) = sgn(1 — a) sgn £. (2.7.20) 


It is easy to derive a connection between the mode m(a,{,7,A) of 
G(x, a, B,7,) and the mode m(a, 8) of the corresponding standard distri- 
bution from the relation (2.1.2) by using the formulas (I.18) and (1.19) for 
passing from the form (A) to the form (B). Thus, in the case a # 1 


m(a, B, y, A) = (Acos (28K (a)))"/" m(a, B) + 7A. 


This equality, together with (2.7.20), leads to the following conclusion about 
the location of the mode m(a, 3, y, À) on the real axis in the case when a # 1 
and 3 > 0. Namely, 


mla, b, Y, ÀA) > 7A _ ifa <1, 


, (2.7.21) 
MABA) < JA a> I. 


In particular, the mode of the unbiased distribution (i.e., the distribution 
having parameter y = 0) in the case a Æ 1 is always on the same semi-axis as 
the mode of the corresponding standard law. This conclusion, as well as the 
inequality (2.7.21), remains valid also for the form (A). 
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TABLE OF VALUES OF THE FUNCTION g'(0, 1, 3) 
B g’(0,1, 8) 


REMARK. The errors in these values of g’(0, 1, 3) do not exceed 0.0005 
in absolute value. The computations were carried out on the basis of 
the following representation of this function: 


1 1 
g'(0, 1, 8) = 282 L, U(U — 1)(exp(—U) dy, 


where (1+ Be) i 
_ 7 tpp = 
=a exp { 5 te + 1/8) tan(me/2) | 
Unfortunately, the case a = 1, 8 # 0 cannot be cleared up so easily because 

of the lack of a sufficiently simple expression for g'(a, 1, 3), which would enable 
us to establish the sign of the derivative at least for the one point a of the real 
axis. At the same time, computer computations of the values of g’(0, 1, 3) for 
0 < 8 < 1 (see the table above) show that g’(0,1,) < 0 and, consequently, 
m(1, 3) < 0 for 8 > 0. From this, the fact that 


m(1, 8, y, à) = $Am(1, B) + yA + BAlog (FA) 
(see (2.1.2) and (I.19)), and the fact that, according to (2.1.3), 
m(a, —8,—y, A) = —m(a, 8,74, A) 
for all admissible parameter values, we get the estimates 
m(1, 8, Y, À) < yA + BAlog(mA/2) if B >, 
m(1,8,7,A) > yA + BAlog(mA/2)_ if B <0. 
A number of interesting observations about the modes m = m(a, 33,74, \) 


of stable laws are contained in recent papers of Ken-iti Sato* dealing with the 
study of properties of modes. 


*The author has at his disposal only preprints of these papers (published by Nagoya 
University) under the titles: 

1) Bounds of modes and unimodal processes with independent increments; 

2) Behavior of modes of a class of processes with independent increments. 
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(i) The mode m, as a function of À, is either convex or concave on the 
semi-axis 4 > 0. More precisely, the sign of the second derivative of m with 
respect to A coincides with the sign of ĝ, i.e., 


sgn m” (a, B, y, À) = sgn B. 
The form of the first derivative m’ with respect to A enables us to conclude 
that, as À varies from 0 to oo, the mode m/(a, B, y, A) changes sign (moreover, 
only once) always in the case a = 1, and if and only if y8(1 — a) < 0 in the 
case a Æ 1. 


(ii) It is possible to determine precise bounds for the variation of the mode 
m(1, 3), with account taken of the fact that m(1, —8) = —m(1, 8). Namely, 


Im(1, 8)| < ApB3/?(2p-! + (1 — p)~* + 4n—76?(2 — p)-*)*/?, 
where 0 < p< 1, 
By = 2r(1 + p)sin $p, 
and the quantity Ap, which is nonincreasing with p (it does not have a suffi- 


ciently simple explicit expression) can be estimated above by the quantity 


A* = inf{Aqp:0 < q < min(1,p)}, 


— fat1+(p+ 1)? ue 
= (q+1-—(p+1)1/P 

Some additional information about localization of the mode m(a, 8) can 
be extracted from results in [71] (see Theorems 4.1 and 6.1). Let va = 
|1 -['(1+ a)| sin ra/2. 

IfO< a < 1, then 


Ag.p 


m(a,—-1) < m(a, B) < m(a, 1), 
with the inequalities strict for |8| 4 1, and 
(ir(1-a)) ™* < m(a,1) +a < (2F(2-a))/*. 
If 1 <a< 2, then 
m(a,1) < m(a, B) < m(a, —1), 
with the inequalities strict for |8| # 1, and 
L((a/P(—a))!/*) < m(a,1) + va < T(1+ a) sin Fa, 


where 


1 u2—% du u-“ du 


(Ë = — en Ses eee imam g Tar; l-—a 
(z) z+ 1+u? J l +u? ` 
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We remark further that the cases a 4 2, 8 = 0 and a = 2 corresponding 
to stable distributions symmetric with respect to the line z = yA are the only 
ones so far in which the exact value of the mode is known: m(a,0, A) = À, 

Summarizing the above facts, we can form the following general idea about 
the behavior of the densities of stable laws. 

1. The graphs of the densities 

g(x, a, B,y, r) and g(x, a, —B, +, A) 
are symmetric with respect to the line z = A, while the graphs of the densities 
g(z,a,8,y,) and g(z,a,—6,7, d) 
are symmetric with respect to the ordinate axis. In particular, if 8 = 0. 
then the graph of g(x, a,0,~, A) is symmetric with respect to the line z = yA. 
Considering that the behavior of the density in the case a = 2 is clear without 
any special investigation, these properties allow us to confine ourselves to the 
case 0 < a < 2, 8 > 0, y > 0 when analyzing the behavior of the graphs of 
the densities. 

2. If 0 <a <1 and ĝ = 1, then g(z,a,1,¥,) is nonzero only at points 
of the semi-axis z > yA. In all other cases g(x, œ, œ, y, À) is nonzero at each 
point of the real axis. 

3. The graphs of the densities g(z,a,3,7,) are unimodal and have a 
unique mode, which is located to the right of the point yA if 0 < a < 1 and 
p > 0, and to the left of yA if 1 < a < 2. 

4. If-1< 8 <1, then g(x, a, B, y, À) decreases like const z~!~-® as t — ov. 
But if 6 = —1, then g(x, a, 3,7, A) decreases exponentially in the case a < 1 
as x — y\ — 0 and in the case a > 1 as x — oo. More precisely, 


log g(z, Q, B, > x) ~ — const Elz, a) 


(the function € was defined in §2.5). 

We supplement the facts proved above by some more properties (proved in 
[40]) of densities of stable laws. 

5. If a < 1, then for any positive Ay # àz the graphs of the densities 
g(x, a, 8,y,A1) and g(x, aœ, 1,0, A2) intersect only once on the semi-axis x > 0 
where they are concentrated. 

6. For any 0 < a < 2 and any positive A; Æ A» the graphs of the densities 
g(x, a, 0,0, A1) and g(z, a, 0,0, Az) intersect at only two points (symmetrically 
located with respect to the point x = 0, of course). 

The graphs of densities of stable laws given in Figures 2-4 illustrate the 
properties listed above.* 


*The graphs in Figures 2-4 are taken from graphs of densities of stable laws contained 
in the survey article [31] by Holt and Crow. These figures are reproduced here with the 
permission of the authors of [31]. 
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FIGURE 2. Graphs of densities g4(z, a, 3) corresponding to 
the values a = 0.25 and a = 0.5. 
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FIGURE 3. Graphs of densities g4(zx, a, 3) corresponding to 
the values a = 0.75 and a = 1. 
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FIGURE 4. Graphs of densities g4(z, a, 3) corresponding to 
the values a = 1.25, a = 1.5, and a = 1.75. 


The paper [40] cited contains another observation of interest to us, based 
on the following fact (which is proved there). 


7. Suppose that X is a positive random variable with absolutely continuous 
distribution. Then the following assertions are equivalent: 


a) The distribution of the random variable log X is unimodal. 
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b) There exists at least one version p(x) of the density of X (one version 
differs from another at most on a set of Lebesgue measure zero) such that for 
any c > 0 the graphs of the densities cp(cx) and p(x) intersect at precisely one 
point on the semi-azis x > 0. 

This criterion and property 5 obviously imply that in the case 0 < a < 1 
the density 

exp(z)g(exp(z), a, 1), -0 < T<, 
of the random variable log Y (œ, 1,0, 1) is unimodal. 

The recent article [247] studies a certain property of the density of a sta- 
ble law which refines the concept of unimodality, and the following fact is 
discovered. 

8. The density p(x) of a distribution will be said to be bell-shaped if p(x) 
has infinitely many derivatives at all points of the real axis, and for any integer 
k the derivative p‘*) (x) has exactly k zeros on the support of the distribution. 

Then the densities of all stable laws are bell-shaped in this sense. What 
is more, the number of positive zeros and the number of negative zeros of 
g‘*) (x, a, 3) can be computed in the case a £ 1. 


§2.8. Stable distributions as solutions of integral, 
integrodifferential, and differential equations 


In the Introduction we gave a canonical form of expression for the charac- 
teristic function g(t, a, 8) (valid both for the form (A) and for the form (B)), 
and we shall transform it somewhat: 


g(t) = g(t, a, 8) = exp th 
= exp |- [Le — 1 — itsin RaZ) , (2.8.1) 


where R(x) = —rH'(zx), x # 0, is a function that is nondecreasing on the 

semi-axes x < 0 and z > 0. It has the simplest form if the parametrization is 
taken in the form (A): 

R(z)= 4 2 iT(1 + a)sin(ra/2)(1 + )z7%, z>0, 

a +T(1+a)sin(ra/2)(1—)|z|-*%, «<0. 

The function R(x) corresponding to the form (B) is obtained by replacing 

the parameter 6 = (Gy, in (2.8.2) by its expression in terms of a and Gp. 


Below, let = 
ee f Ear 
m 


(e7 — 1 —itsinz) dH(a)| 


(2.8.2) 


be the integral sine function. 
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THEOREM 2.8.1. For either of the two forms of parametrization (A) or 
(B) the distribution function G(x, a, 3) and the density g(x,a, 3) satisfy the 
equations 


ee ae J (G'(z,a, B) Si(y) 


y#0 
— G(z — y,a, 3) + G(z,a,3))dR(y), (2.8.3) 


zga, p) = f (oeat - olz- 0,6) R(y)dy. (2.8.4) 


y#0 
PROOF. We transform (2.8.1), integrating the integral in the exponential 
by parts and considering that 


J (ew —1 - itsin) Z =O(u?) asu—>0, 
0 


i (e*’ — 1 —itsin TE = O(logu) asu— oœ. 
1<|v|<u E 


As a result, 


pec l J p J Mee o it Situ) dR(y)} l (2.8.5) 


Differentiating both sides of (2.8.5) with respect to t £ 0, we find that 


a'(t) = Fal) f (e — 1 — it Silu) dR). 


From this, after the change of variables t = s/z, z # 0, we get 

d 8 1 (/s 18 

Ba A Pde ea val tesy/z_ 1 _ “as 

Zo ( =) =g ( =) / (e 1-5 si(v)) dR(y). (2.8.6) 
We transform the inversion formula 

= 1 —7tz 
gleo) = zi fegiyat, 240, 
by carrying out the change of variable tz = s: 
_ SEnr —13 = 
rg(x, a, 3) iE. on fe g (=) ds. 


Differentiating both sides of this equality with respect to z and replacing 
(d/dx)g(s/x) by its expression (2.8.6), we get that 


(xg(zx,a, 3))’ = -r feos (=) =| G -1—  si(y)) dR(y). 


T 
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A change in the order of integration and the inverse change of variable s = tr 
leads to the equality 


(xg(z, a, p) = -z J arty) [cer —1—itSi(y))e~* g(t) dt. 


The inside integral is a linear combination of integrals, each of which is none 
other than the inversion formula for the density or its derivative, i.e., 


(zg(z, 0, B)! = - / gir- wa ~ gla;o, 8) 


+ g'(x, a, B) Si(y)) dR(y). 


The equality (2.8.3) is obtained by integration of both sides of the equality 
with respect to x from —oo to x and use of the fact that zg(x, a, 8) — 0 as 
xz — —oo. Integration by parts in (2.8.3) leads us to (2.8.4). 

REMARK 1. Although Theorem 2.8.1 is connected with stable distributions 
normalized by the conditions y = 0 and à = 1, the assertion of the theorem 
extends easily to the general case. Indeed, since 


Ge a, 9,7, 4) = G(x — DA", a, 8,0, 1) 


by (2.1.2), where the quantity l is connected with 3, y, and À by simple 
formulas, we get the equations for G(z,a,3,7,) and g(z,a,3,7,) from 
(2.8.3) and (2.8.4) by replacing z by (x — 1)A~}/@, 

REMARK 2. Note that in the course of the proof we obtained an integro- 
differential equation (2.8.7) for the densities of stable laws. 

Integral and integrodifferential equations for stable distributions can differ 
(sometimes very strongly) in form, though they are all equivalent in the final 
analysis. The integrodifferential equation (for densities with a 4 1) obtained 
in [60] is apparently the closest to equation (2.8.7), which has taken from [40] 
along with (2.8.3) and (2.8.4). 

Below we shall meet a certain integrodifferential equation differing strongly 
from those given in Theorem 2.8.1. A particular merit of it is that differential 
equations for densities in the case of rational a Æ 1 can be derived from it 
in a fairly simple way. To formulate this equation we must (following [59]) 
introduce the concept of fractional integration and differentiation. 

The fractional integral of order r > 0 of a function h(t) on the semi-axis 
t > 0 is defined to be the function 

ear 


To (t — x)"—!h(t) dt 


(of course, h(t) must be such that the integral on the right-hand side exists). 
If r is an integer, then the operator J” coincides to within a sign with the 


(2.8.7) 


Pilz) = 
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operator of r-fold integration, i.e., 


I"h(x) = — J I™-*h(t) dt. 


Here I? is understood to be the identity operator (J°h(x) = h(z)). 
The operator I~" of fractional differentiation of order r > 0 is understood 
as the inverse of the operator I”, i.e., 


I" (I-Th(z)) = h(z). 


If r is an integer, then J~" = (d/dx)~". In the case when r > 0 is not an 
integer the operator I7”, like J”, can be given an integral form. Namely, if 
0<r< 1, then 


amr 
e 


Ih) = A J ” h(t) — h(z)](t — 2)-7-? dt. 


But if r > 1, then 

I~" h(x) = I-*(I-°-™ h(z)), 
where n is the integer part of r. The set {I", —oo < r < oo} of operators thus 
defined forms an abelian group (7:74). 

The operators J” can be used to write a number of equations for the densi- 
ties of stable distributions in the class W. In the functions y(x) = zrg(z, a, 0), 
x > 0, considered below, the densities g(z,a,@) are parametrized in the form 
(C) (but we omit the index C). 

We mention several special properties of the operators J” that are needed 
in what follows. 


LEMMA 2.8.1. Letz be a complex number. If r > 0 and Rez > 0, or if 
r <0 and Rez > 0, then 


I” exp(—zzr) = exp(—inr — zxr)z7". (2.8.8) 
In particular, for r > 0 and z = ~it, t > 0, 
I~ exp(ttz) = exp(imr/2 + itz)t’. (2.8.9) 


Let s be a real number, and n the smallest integer such thatn+s>0. Then 
for anyr<s 


s(s + 1)---(s+n—1)I(s—7r) 


Ir~ ° = exp(—inr) T(s +n) ge, (2.8.10) 
In particular, if s > 0, then for any r < s 
ah 
Fr” = exp(itr) Ñ a (2.8.11) 


Fia) 
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PROOF. The case r = 0 clearly does not need to be analyzed. If r > 0 and 
Rez > 0, then 


I" exp(—zz) = eo | e~**(t — x)! at 


exp(—irr) —22 f —2tyr—1 
= —_——e en ee, 
P(r) 0 


It is not hard to see that (2.8.8) follows from this. 
If —1 < r < 0 and Rez > 0, integration by parts given us that 
exp(—z7r) 


I" exp(—zz) = Tin) fe —e **\(t—2)"—! dt 


a (ee) 
= Te: | e~**(t — x)" dt. 
r(1 +r) m 

The integral is then transformed as in the preceding case. 

If r is a negative integer, then I” is the operator of r-fold differentiation, 
and verification of (2.8.8) does not involve difficulties. 

But if r is negative and not an integer, then, choosing a positive integer n 
such that —1 < r +n < 0, we can factor the operator, I” = [*t"]-" with 
the result that 


I” exp(—zz) = pie exp(—zz) 
= exp(2mn)z"I"*" exp(—zz), 


i.e., We arrive at a case already treated. 
The verification of (2.8.10) is by analogous arguments. 


THEOREM 2.8.2. Letz >O, and leta # 1 and 6 be some pair of admissi- 
ble parameter values. Then for any r > —1/a the function w(x) = zg(z, a, 6) 
ts the real part of a function x(£), € = 17°, satisfying the equation (7:75) 


rl~°"(2~*x(x~*)) 


= exp(—imr + ima(1 — 6)r/2)é"I-"x(€). (2.8.12) 
PROOF. Consider the function 
x(€) = F exp(it — Et% exp(irað/2)) dt. (2.8.13) 
0 


A simple transformation of the integral in (2.2.1a) shows that u(r) = 
Re x(€). We show that y(€) satisfies (2.8.12). Using (2.8.8), we find that 


I~"y(€) = f exp(it)I ~" exp (—&t® exp(irað/2)) dt 
0 
= exp(inr + i7a6r/2) 


x F t™ exp(it — €t* exp(i7a0/2)) dt. (2.8.14) 
T Jo 
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Further, since 
pa) =a") 
= X J explitz — t® exp(inað/2)) dt, (2.8.15) 
0 


T 
it follows from (2.8.9) that 


sr pie} = f exp(—t® explinrað/2))I 7% exp(itz) dt 


oO 
= exp(imar/2)= J t°" explitz — t® exp(irað/2)) dtz 
0 


= €" exp(inar/2)= J t" explit — €t* exp(imaO/2)) dt. (2.8.16) 


Comparison of (2.8.14) and (2.8.16) confirms (2.8.12). The condition ar > 1 
ensures the existence of the integrals in these equalities. 

A slight change in the arguments enables us to get another equation of type 
(2.8.12). 

Let us integrate by parts in the expression (2.8.13) for the function x(€): 


we F AY 
xE] = rro exp (i300 1)) 
oo 
al "z  ¢70 T 
«| t*—* exp (it Et™ exp (i708) ) dt. (2.8.17) 
Let 7(€) = €~'x(€). The functions 7, x, and y are connected by the 
following relations: 
r(€) = 2%x(z7%) = 2 ua), 
ülo) = €7%x(€) = Enere), 
and the differential operators with respect to the variables x and € are con- 


nected by the equalities 
d 


i = - gitat, = = — argh ee. (2.8.19) 

THEOREM 2.8.3. For any r > —1 and any pair of admissible values of 
the parameters a # 1 and 0 the function z1+%9(z,a,0) is the real part of a 
function T(E) satisfying the equation 


EtL" T(E) + £1 (1) exp(im(1 + r)) 
= aexp(iar — iZa(r + 1)(1 — 9)) Pe t+) g targy) (2.8.20) 
PROOF. According to (2.8.17), 


r(€) — €"*/in = = exp (iza) x $e"? exp (it — €t* exp (508) ) dt. 


(2.8.18) 
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From this, using properties (2.8.8) and (2.8.11) of the operators J’, we find 
that 


I~" (r(€) — Et fir) 
= IAHE = explirr)r(1 + rje- (+r) 


a Tt 
= = exp (inr + 15 a8(1 + r)) 


"airi 4 _ ĉa Sl 
xJ t exp (it Et™ exp (i508) ) dt. (2.8.21) 
On the other hand, according to (2.8.15) and (2.8.9), 


P)-or+)) u(x) = aa E Uta) i C a) 


= exp (iZ (a(r +1)- 1)) 
1 


oo 
x F pairt- ep (itx — t° exp(imad/2)) dt 
0 


= £'T! exp (iF (a(r +1)- 1)) 
T [ 7 t*("+)—1 exp(it — Et% exp(imab/2)) dt. (2.8.22) 
Comparison of (2.8.21) and (2.8.22) gives us (2.8.20). 
COROLLARY. Let r=1/a—1. In this case (2.8.20) simplifies: 
P"r(&) + iT(1/a)exp(ir/a)g-" ° 
= aexp(tm(1 — a)/a — in(1 — 0)/2)Er(E). (2.8.23) 
We single out another corollary as an independent theorem, since it is 


connected with the case when the integrodifferential equation (2.8.20) becomes 
an ordinary differential equation (2-26). 


THEOREM 2.8.4. If a= p/q is a rational number different from 1, then 
for any pair a,@ of admissible parameter values the density g(x,a,0) is the 
real part of a function u(x) satisfying the equation 


Zi 
q 1+p/q 4 ; 1+p/q 
qi (sites) (mtua) 


p—1 
= p? exp (-iZp0 - 0) = (=) p(x) + jE FD ote (2.8.24) 


The proof of (2.8.24) reduces to a transformation of (2.8.20) in the case 
when a = p/q and r = q — 1. The passage from the variable £ to z is by 
means of (2.8.18) and (2.8.19). With their help, equation (2.8.24) can be 
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given another form if we write it for the function 7(€) and pass from the 
variable x to £: 


PE (4) r(e) + i" Trag 


= p+q tl 6 1+a/p 4 Po 1+q/p 2.8.25) 
= (-1)?*%p? exp (iZ p(1- 0)) € =) (Elt4/Pr(e)). (2.8. 


It is not hard to see that the variant (2.8.24) looks simpler than (2.8.25) if 
q < p, and more complicated if q > p. This is very clear, for example, in the 


case p = 1, when (2.8.25) is transformed into the equation pae 
q—1 
a(Z) (6) +(-1)*ex (1501-0) er) 
$ pee =), (2.8.26) 


The above equations, beginning with (2.8.12), have a common feature. 
They are connected not with the density g(z,a,) itself but with a certain 
complex-valued function whose real part is expressed in terms of the density. 
This means that each of these equations is, generally speaking, a system of 
two equations for two functions, of which only one is of interest to us. The 
linear nature of both equations allows us to write an equation for each of 
these functions, but only at the cost of complicating the equation. Thus, the 
order of equations (2.8.24) and (2.8.25) is equal to max(p — 1, q — 1), while 
the order of the equation for the density g(z,a,6) = Re u(x) in the general 
case is 2max(p — 1, q — 1). Sometimes, however, the complication does not 
occur, because the coefficients of the operators turn out to be real. 

We consider this phenomenon in more detail for the example of equation 
(2.8.20) and its particular cases (2.8.24) and (2.8.25). The number 

exp (inr — iZa(r + 1)(1 — 0)) 
is the factor in (2.8.20) with which we shall be concerned. It is real if and 
only if the number 2r — a(r + 1)(1 — 9) is even, i.e., is equal to 2k for some 
integer k: 
2r — a(r + 1)(1 — 0) = 2k. 
From this we obtain 
6=1-2(k-—1r)/a(r +1). (2.8.27) 

Determination of the numbers @ satisfying (2.8.27) means requiring that 8 
belongs to the domain of admissible values, i.e., |9| < ĝa. It is not hard to 
compute that this is equivalent to the conditions 


r<k<r+a(r+1) if0<a<l, 
(a—-1)(r+1)t+r<k<2r4+1 ifl<a<z2. 
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For equations (2.8.24) and (2.8.25) with a = p/q #1 andr = q—1 the 
representation (2.8.27) is equivalent (since r is an integer) to 


0 = 0) = 1 2k/p (2.8.28) 
with the following conditions on the integer k: 


O<k<p if p <q, 
P-qQ<sk<q ifp>g. 


For each pair a = p/q, 0*) satisfying (2.8.28) equations (2.8.24)-(2.8.26) split 
into pairs of mutually unconnected equations for the real and imaginary parts 
of the corresponding function. Furthermore, the most interesting equation for 
the real part is always homogeneous. 

It is not hard to see that for each a = p/q # 1 there are at least two values 
0 satisfying (2.8.28): 0 = 6, and 0 = —6,. The set of a = p/q # 1 for which 
there are no cases of splitting of the differential equations other than cases of 
extremal laws is made up of numbers of the form 1 /(n+1) and (2n+1)/(n+1), 
R= 1,2... 

If the order of the equations is at most two, then the form of the equations 
themselves allows us to hope that they can be solved, at least with the help 
of some special functions. Let us consider these cases. 

1. If a= 4 and @ is any admissible value, then from (2.8.26) we get 


T(E) — 4 exp(imp)Er(€) + 1€-?/n = 0. 


This equation can be solved without difficulty when we recall that r(co) = 0: 


4 be imp i P imp dt 
e= Zexp (Le ) . exp ( 1° 2° 


Consequently, 


23/2 9(z, 1/2,0) = Rer(€) 
= -iim (cexn(27/4) f exp(—/4) 5). (2.8.29) 


z 


where z = £71? exp(izp/2). 
In particular, if 0 = 0 (i.e., p = 5 which corresponds to a symmetric 
distribution), then after suuitable transformations (2.8.29) takes the form 


*(34) ia [ee eG) 
ran z G -5 E) | (2.8.30) 
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where 
1 “ cost sin t 


1 u 
= Til vi sw = Te f ve 


‘ea 


C(u) 


are special function called Fresnel integrals 
2. If a = 4 and 0 = 1 (a case in which equation (2.8.26) splits), then for 
y(€) = Rer(€) we get the equation 


V lE — 3€y(€) = 0. 


This equation is solved (to within a constant multiple) by the so-called Airy 
integral, which can be expressed, in turn, in terms of the Macdonald function 
of order 3: 


I(E) = 3e [ cos(t? + tE) dt = eV Ets (26°/7/3V3). 


To establish the value c we are aided by the expansion (2.4.8) of the function 
r*/3g(x, 4,1) in powers of € = 2-1/3, On the one hand (see 8.4.32(5) in [27]) 
VEK 1/3 (269/?/3V3) => V3T(1/3)/2 as € +0, 


and on the other hand 


_ 4/3 1 \L 1 v3 (5) 
y(€)=2 a (251) ae a : 


as € — 0; therefore, c = 1/37. Consequently, 


1 ls da a 
= = —— ee aa /2 
g (2 7’ 1) 37 Ky1/3 (a ) : (2.8.31) 


There are four more cases when the density g(x, œ, 0) can be expressed in 
terms of special functions. 

3. If a = 2 and 9 = 1, it follows from (2.8.25) that y(£) = Re T(€) satisfies 
the equation 


y" (£) + $E" (E) + F Eyle) = 0. 
4. If a = 2 and 6 = 0, then for y(£) = Rer(Ẹ) we have that 
y" (€) — $E°y' (E) — F Ey(€) = 0. 
5. Ifa = 3 and @ = 63/2 = 3, then, by (2.8.26), the function y(x) = Re u(x) 
is a solution of the equation 


y" (x) + 427y!/(z) + Pay(z) = 0. 
6. Ifa = 3 and 0 = -4, then for y(x) = Re u(x) we have 


y" (z) — gx°y'(2) - p rylz) = 0. 
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Recall that in the last two cases the value 6 = 3 corresponds to the value 


B = —1 in the forms (A) and (B), while 6 = -4 corresponds to the value 
p = 1 in the same forms. 

All these equations are connected with a single type of special functions, the 
so-called Whittaker functions Wk m(x). We confine ourselves to a treatment 
of the particular case a = z, 0 = 0. The other cases are handled similarly. 

It is known (see, for example, 2.273(5) in [39]) that in the chosen case a 
function of the form 


y(€) = CE! exp (2) W-1/2,176 (483) 
is a solution of the equation. As in the analysis of the preceding case, the value 
of the constant C can be determined by comparing the asymptotic behavior 
of y(€) as € = x~*/3 — oo, obtained in two different ways. On the one hand 
(see 9.227 in [27]) 


y(f) ~C 
On the other hand, on the basis of (2.2.11) 


2 2 3 
— pl+2/3 4 a wi Pays á = 14+2/3 
y(€) T g (« 5,0) T g (o 5,0) An 
This implies that C = 1/(2v3r), i.e., for any z > 0 


2 2 
g (« 50) S (-z 2) 


g! 2 43 
= — . 2.8.32 
2/37 exp (fe ) W_ 1/2,1/6 Ga ) ( ) 


In the remaining cases we obtain in the same way the following expressions 
for the densities on the semi-axis x > 0 (?-?9): 


eh ers eee 4? (2.8.33) 
g Tig» oa Jar p 27 1/2,1/6 27 ; 0. 

31 g 2 43 

ey eee = 2.8.34 

3 1 as 2 4 

~,—— W_ ; 2.8.35 
g g 9° z= T, m TE 5 1/2,1/6 san ) ( ) 


According to the duality law (2.3.3), the densities under consideration on 
the positive semi-axis must be connected by the equalities 


xg (xz, 3,4) =2~9/%g (yan, 3,1) : 


xg (x T, 3, —3) et gay (a8, 3,0) ) 


S478 3v3 
Ea (1+3/2) _ ee, 
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and this can indeed be observed in the above formulas (therefore, in partic- 
ular, we can obtain (2.8.35) as a consequence of (2.8.32), and (2.8.33) as a 
consequence of (2.8.34)). 

The integral representation 


ge j2 [ .—t41/6 | ion 
W_1/2,1/6(2) = T(7/6) 7 J e l (1 T *) dt 


of the function W_1/2,1/6(z) in the z-plane with a cut along the negative 
semi-axis (see 9.221 in [27]) shows that this is a multi-valued function with 
third-order algebraic branch points z = 0 and z = oo. Consequently, the 
function W_4/21 7e(2°) is analytic in the whole complex plane, and we can 
speak of its values W_1/21 Je(2°) on the negative semi-axis, although the 
formula given is unsuitable for expressing them. This consideration, together 
with the consequence 


g (z, 3,3) =g (—2, 3-3) 


of (2.1.3), leads to new expressions for the densities g(z, 3, 4) and g(z, 4, 1): 
Ji g7! 2, 4, 
a (233) m N3 exp ( 5 T ) Wasa (-57 ) ’ 
2 g? 2 4 
=,1 =a IW. -—r*). 
g (2 3? = 2/37 exp (5r ) 1/2,1/6 ( 377 ) 


A comparison of these expressions with (2.8.34) and (2.8.33) shows that in 
the complex plane 


W_1/2,1/6(—2°) = —2W_1/2,1/6(2°). 
We conclude this section with another theorem. 
Consider the function 
_ fzg(z,a,8) ifaf1, x>0, 
Salg, n) = Fay ifa = 1, 8 >0, 
where the variables € and n are connected with x and 8 by the relations 
Be — log x ifa #1, 
-~ |a2/B-logG ifa=1, 
_ { (r/2)8K(a)/a ifa#1, 
228 if w= I. 
Let z = + in and define the function Ya(z) as follows: if 0 < a < 1, then 


Tal) = : i exp (-t — t* exp (-i50 + az) ) dt; 
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if 1 < a < 2, then 


i 
Pal) = F exp(ite * — z — t“) dt; 


and if a = 1, then 
(z) = if exp (- (5 +2) t — tog t) dt. 


Returning to the representations (2.2.8)-(2.2.10) of the densities g(x, «a, 8), 
we see that 
Salé y) = Real). (2.8.36) 


It is obvious that Ya(zx) is an entire analytic function for all a. 
We thus arrive at the following assertion. 


THEOREM 2.8.5. For each admissible value of a the function Sa(£, n) is 
a solution of the first boundary value problem of the Laplace equation AS, = 0 
in the strip 
-00 <E<co, -70,/2< 9 < x0,/2 


for the case a #1, and in the half-plane —œ0 < € < œ, n > 1/2 for the case 
a=1. Furthermore, the boundary conditions are 


Sa(+00,7) = 0, 
Sa(€,+70q/2) = e~*g(e~®, a, +sgn(1 — a)) 
for a#1, and 
Si(+00,n) = Si(§,00)=0,  Sı(£, 7/2) = g(€, 1,1) 
for a= 1. 


REMARK. Here the appearance of the domain of variation of the variables 
€ and 7 is connected, of course, with the restrictions to which the parameter 
p is subjected. But if S.(€,7) is defined directly as the real part of V,(z), 
then it is harmonic at each point of the (£,7)-plane. 


§2.9. Stable laws as functions of parameters 


Of the four parameters a, 3, y, and À determining the distributions of class 
G, the last two play the role of shift and scale change parameters. It is clear 
from property (2.1.1) that 


G = G(z, a, 8,7, A) = Glia- DAYS a, B,0, 1), 


where l is uniquely determined by the parameter values. Therefore, for fixed 
a and p the distribution G admits a representation as a superposition of a 
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standard distribution G(z,a,) and a linear function (rz — DA- with a 
relatively simple dependence of l on the parameters. Thus, the study of the 
properties of the distribution G as a function of y and à reduces in fact to 
the study of the properties of G(x, a, 3) as a function of z for fixed a and 8 
and as a function of z,a, and ( in the general case. 

It follows from Theorem 2.8.5 that the densities g(x, œ, 8) are entire func- 
tions of the modified variables € and n, which are variables on an equal footing 
in the sense that they are the real and imaginary parts of the single variable 
z in (2.8.36). 

An analysis of the function Ya(z) shows that its order as an entire function 
is infinite for any a. This indicates that S,(€,7) also has infinite order as an 
entire function of € or 7. 

The integral expressions for V,(z) enable us easily to obtain a series expan- 
sion of it in powers of z, and from that it is the very easy to find an expansion 
in a double series of powers of € and n, after which separation of the real part 
of the double series gives us an analogous expansion of the function S,(£,7) 
in a double series. However, we do not give these fairly unwieldy formulas 
here; they are not hard to get by following the remarks above. 

Regardless of the choice of the parametrization system, the distributions, 
in the family G are continuous with respect to the parameter a in the in- 
tervals (0,1) and (1,2). The distributions G(z,a,() corresponding to the 
parametrization systems (M) (for the whole family S), (C), and (E) (for the 
subset W of it) turn out to be jointly continuous in all variables in the domain 
where they vary. The question of the behavior of stable distributions when a 
is near zero and of the distributions Gyy(z,a, 8) near a = 1 merit extra at- 
tention here, the latter in connection with the desire for good approximations 
of the distributions Gm (zx, a, 8) in terms of Gy(z, 1, 8). 

Let 0 < a < 1, and consider random variables Y(a,@) = Y(a,,0,1), 
0 = BK(a)/a = ß, and Z(a, p), p = (1+6)/2, with the respective distributions 
G(z,a, 3) and (G(z, a, 8) — G(0,a,8))/p. Denote by Ug a random variable 
taking the values +1 with respective probabilities (1+6)/2, and by E a random 
variable independent of Ug and with the standard exponential distribution 
(i.e., with mean value equal to 1). 


THEOREM 2.9.1. The following relations hold for any sequence of pairs 
of admissible values a < 1 and p' = (14+ 6’)/2 such that a > 0 and p' — p= 
(1 + 6) /2: 


Y%(a, 6’) = |Y (a, 0’)|% sgn Y (a, 0’) +, Ue/E, (2.9.1) 


Ze (a, p') $ 1/E. (2.9.2) 
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NOTE. A modified concept of power is used on the left-hand side of (2.9.1), 
and it is not understood (as we stipulated in §2.1) that the separately written 
random variables |Y (a, 6’)|* and sgn Y(a,6’) are independent. There are 
more details on the modified power and on the random variables Y(a,@) and 
Z(a,p) at the beginning of Chapter 3 and in §3.1. 

PROOF. The elements of the characteristic transform of the distribution 
of Y*(a, 6’) have the form 


1, _ C08 5(k — as6’)T(1 — s) 7 
aL a cos 5 (k — as)I (1 — as)’ eee 
For fixed —1 < s < 1 it is obvious that as a —> 0 and 6’ > 0 
wo(as, a, 6’) + T(1—s) = wols), 
o( ) (1 — s) = wo(s) (2.9.3) 
wi(as, a, 6’) > Or (1 — s) = w (8). 


We show that the pair wọ, wı corresponds to the characteristic transform of 
the random variable U,/E. Indeed, 


E|U,/E|* = E(E-')* = f a” e de = wols), 
0 


E|Ue/E|* sgn(U/E) = E(E~')% sgn Up = (E sgn Ug)(E(E~')°) 
= 8wo(s) = wi(s). 
Consequently, (2.9.3) is equivalent to (2.9.1). The relation (2.9.2) can be 
proved similarly (?:°°). 

The distribution functions of the random variable Y°%(a,0), the cutoff 
Z°(a,p) of it, and their limiting random variables Ug/E and 1/E as a — 0 
are as follows: 

F,(z) = G(|z|!/% sgn a, a, 8), z€ R’, 
F(z) = 4[G(x/*,a,p)-(1—p)],  2>0, 
Fo(x) = $(1— 0 + (sgn z + 0) exp(—1/|z|)), zé R’, 
Fo(x) = exp(—1/z), x > 0. 

The relations (2.9.1) and (2.9.2) can be refined by constructing asymptotic 

expansions of F(x) and F(z) in series of powers of a as a — 0. To do this 


we need to introduce some new notation. Let a = ai!) = bi = —C, where 
C is the Euler constant, and let 


8 


27(2” — 1)!-F| Bn] ny” a ¢(n) 
Men prg a U, 
_ 21Bal a yf OU) 

* nT (n+ 1) ae are 
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for n > 2 and k = 0,1, where the B, are the Bernoulli numbers, and ¢(n) is 
the value at n of the Riemann ¢-function. With the help of alf and bn we 
determine the quantities Ch (©) and Da by 


exp (Š ae") = 
n=1 


exp (È hs") 
n=1 m=0 


Clearly, the quantities D,, are formed from the quantities b, by the same 
rule as the C\*) are formed from the aP. In both cases the so-called Bell 
polynomials Cn (u1, ..., Uun) realize this rule (an explicit expression for them 
is given in §3.6). Namely, 


ch) = Cp(1lafP,... nla), 
Dn = Cn(1!b1,...,n!bn). 


We define the polynomials Pa (z), n = 0,1,2,..., by the recursion relation 
Po(z)=1, Patile) =2Pa(z)+2Pi(z), n20. 


It is not hard to compute that the generating function of these polynomials 
is equal to 


2 Pae) ig = exp(z(2* ~ 1): 


Consequently, P,,(x) can if desired be interpreted as the nth-order moment 
of the Poisson distribution with step 1 and mean value z > 0. 


THEOREM 2.9.2. Let —1 <0 < 1 and z E€ R! be fized numbers. Then 
the following asymptotic erpansions are valid as a — 0 (%31): for any real 


A0 


3 
II 


and for any x > 0 


Ê (z + oD, P,,(—1/z) exp(—1/z)a”/n!. (2.9.5) 
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PROOF. We transform the function wz(as,a,0) by using the quantities 
a\*) introduced above: 
cos 5 (k — abs) T(1— s) 


cos 5 (k — as) T(1 — as) 


exp (S 0000)" r(1- s) 


wk(as, Q, 0) = ka 


= oF > ote (os)” SI" T(1- 9), k=0,1. (2.9.6) 


n=0 
Here, 6° = 1 if 0 = 0; see §3.6 about the transform (2.9.6). Adding wo and 
wı and subtracting wı from wo, we get that for —1 < s < 1 


7 8 = = 0) 1 a” n 
2f mL te) 5o (ct + 605 8 r(1- s8), 


N ne (2.9.7) 
8 a 0 a” n 
2 f (2) dF, (2) = Dies ) — 900) srr(a — 3). 


The convergence of the series is a consequence of the analyticity in the disk 
|s| < 1 of the functions multiplying I'(1—s) in the expression for wz(as, a, 0). 
Further, 


Since 


it follows that 
s"T(1—s) = f Hdmi- 
0 
In a completely analogous way, 


sra-s)= f " (—2)* d[Pa(1/2) exp(1/2)]. 


= 00 
After inversion of the Mellin transforms, these equalities together with (2.9.7) 
to the formal equality (2.9.4). The formal relation thus obtained has the 
rigorous meaning of an asymptotic expansion. This fact is proved by the 


164 2. ANALYTIC PROPERTIES OF DISTRIBUTIONS IN G 


standard method used in such cases. The proof of the next result, Theorem 
2.9.3, in which an analogous problem is solved, can serve as an example. 
It should just be taken into account that by setting s = zt (t € R!) we can 
interpret the Mellin transform as a characteristic function. The inverse Fourier 
transforms of the individual terms exist because of well-known properties of 
the function I: 


IT (1 —it)| ~ V2rlt|!/? exp(—z|t|/2) as |t| > oo. 
The relation (2.9.5) can be proved with analogous arguments by starting 
from the equality (see (2.6.20) and (3.0.2)) 


sintpas [(1—s) 
psin tas (1 — as) 


= exp Š n r(1- s8) 


aa — Lil = 9), 


M (as, a, p) = 


The asymptotic behavior of the distribution Gag(z,a,@) near the point 
a = 1 (where it is continuous, as mentioned in §I.3) will be studied on the 
basis of the characteristic function gy(t,a,@) corresponding to it. 

Let us consider the difference 


A(t) = log gm (t, a, 8) oa log gm(t, 1,9) 
= —|t|(|t/*~* — 1) + 2t(|t/*~* — 1) A tan Za + it 8? log |t]. 


For brevity we let € = a — 1 and expand the last expression in a series of 
powers of €: 


A(t) = —|t|(exp(€ log |t|) — 1) + it log |t|8 (2 — ecot Ze) 
— at(exp(é log |t|) — 1 — e log |t|)G cot Fe 
- (|t| log |t| + itß t log? |t|)e +- 


oO 
= — -J ine. 
n=1 


Consequently, 


gm(t, a, b) = gm(t, 1, B) exp @ Ss me") 


m(t, 1, B)(1 — hie — (2h2 — h?)e? /2 —---). 
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Formal inversion of this equality yields an expansion of the density g m(2, @, 3) 
in an asymptotic series of powers of £ as € — 0: 


gm (zx, a, 3) ~ gm(z, 1,8) — Hı (z, Be —---. (2.9.8) 


Although the functions H;, H2,... can be expressed in terms of the distri- 
bution Gy(z, 1, 8) and its multiple derivatives (including mixed derivatives) 
with respect to the variables z and (3, these expressions are so unwieldy that 
we confine ourselves to computing only the first term of the asymptotic series 
(2.9.8) (73%), i.e., the function H;(z, 3) corresponding to 


hı = |t| log |t| + it? log? |t]. 


THEOREM 2.9.3. Letz e R! and -1< B<1. Thenasa—1 
(gm (z, a, 8) — gm (az, 1,8))(1 ey — H;,(z, ß) 


x, 0? 
= [bf pOm(z 1,6) + 233 m(z,1,5) 


T 
- gm(2,1,8) - (Z2 +8) zome, 1,8). (299) 
PROOF. By the inversion formula for the densities, 


Ba = gm (z, a, b) > gm (Zz, 1, B) 
e~"*(gas(t, a, 8) — gmlt, 1, 8)) dt 


N 1 g ey 
=> u(t, 1, B)A(t) dt + 6, (2.9.10) 
where 
1 —it 
= itz g =j- 
= m(t, 1, B)(e4 A) dt 


Let us estimate |6|, considering that |e] < 4. To do this we first estimate 


|A(t)|: 


|A] < max((¢], |¢|?/*)| log |t| |e + |t] | log |t| |e? + max(|¢], |t|3/?) log? |t\e 
< 3|t|(1 + lt] + log? itle. 


Since |gm(t, 1, 8)| = exp(—|t]) and Re A = —|t|® + |t|, it follows that 


lam (t, 1, 8)(e* — 1 — A)| < exp(—|t|) max(1, exp Re A)|A|?/2 
< max(exp(—|t|), exp(—|t|*))|A|?/2. 
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Consequently, |6| = O(e?). Further, it is not hard to compute that 
A(t) = —hy(t)e + n[(1 + t?)(1 + logt |t|)]e? 


where |n| < const. 
From this and (2.9.10), together with the estimate for ó, we get 


Ba = -g | gmlt 1 8)h (i) dt + O) 
= —H,(z, B)e + O(e?). 


It remains for us to compute the integral H(z, 3). According to (2.1.2), for 
any A > 0 


gM = gMt, 1, 8) = ett dGm (5 = * Blog, 1, 3). 


This implies that 


sa 
1 a - T g (5 - * Blog 41,8) — D(a) dz, (2.9.11) 


where D(x) = (1+ sgn z)/2. 
For brevity let 
y = -log gm(t, 1, 8) = |t| + it2 Blog |t|. 


Differentiating both sides of (2.9.11) with respect to À and then letting A = 1, 
we arrive at the equality 


¥ exp(- ph) = ie (z + 6) gm(z, 1, 8) dz. (2.9.12) 
We now differentiate both sides of (2.9.12) with respect to ĝ: 

2 2 

zi z a = 

z log |t|e = log |t|we 


On the other hand, setting à = 1 in (2.9.11) and differentiating both sides 
with respect to 8, we get 


Z log tle = [ee * Gm (tI, 1, 8) dx (2.9.14) 


Repeated differentiation of (2.9.14) with respect to 8 gives us that 


$: 2 — itr 9? 
--zftlog \tle~¥ = fe age OM (2, 1, 8) de. (2.9.15) 
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Equalities (2.9.13)—(2.9.15) enable us to represent as a Fourier transform the 
function 


Gm (t, 1, 3)hi(t) = log |t|be~ — (B/r)it log? ltle~”. 
This yields the expression (2.9.9) for the function Hits, 8). 


§2.10. Densities of stable distributions 
as a class of special functions 


Many facts indicate that, by virtue of their richness of analytic properties, 
the functions g(x, a, 8) merit being distinguished as an independent class and 
accorded “civil rights” in the theory of special functions. Some of these facts 
are presented below. 

The functions g(z, a, 1) with 0 < a < 1 turn out to be useful in the theory 
of Laplace transforms and the operational calculus connected with it. Let 


Vis) = A exp(—sz)v(z) dz, Res > 0, 


denote the Laplace transform of a function v(x) (briefly: V(s) = v(z)). The 
following connection formulas are well known (for example, see [19] or [15]). 
If V(x) — v(z), then, for instance, 


V (vs) = p i exp (-<) v(u) du; (2.10.1) 
for any number c > 0 
1 "e u? 
V (es + ys) - NAA u(x — cu)~3/? exp (a) v(u)du. (2.10.2) 


The functions g(z,a,1) with 0 < a < 1 provide an opportunity for gener- 
alizing these relations. 


THEOREM 2.10.1. IfV(s) — v(x), then the following relations hold for 
any0<a< 1 and any c > 0 (233): 


V (8%) = u71 agru e a, 1)v(u) du, (2.10.3) 


z/c 
V(cs+s%) - f u~! agn Vag — cu), a, 1)v(u) du. (2.10.4) 
0 


si 
In particular, for a = 3 


a N uwt ( : uta) v(u) du (2.10.5) 
er TE — 10. 
0 3v3 


V (s1/3) — Te 
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i E i u sie 
V (cs + 81/3) = x | ( =) 
3 pe 


2 u? ue 


where K,/3 is the Macdonald function of order 1 js. 


PROOF. By (2.6.10) and (2.1.2), for any z > 0 
exp(—zs*) = J eg VWag(g-V/euy, a, 1) du. (2.10.7) 
0 


Using this equality and the integral expression for V(s%), we obtain (2.10.3) 
after changing the order of integration. 
Further, by the same equality (2.10.7), 


V(cs+ 8%) = J exp(—csy — s*y)u(y) dy 
0 


=f J exp(—s(cy + u))y7!/“g(y~*/%u, a, 1) dy du. 
0 0 


The change of variable u = x — cy gives us an integral of the form 
oo 
J exp(—sz)U(z) dz, 
0 


where the function U (z) coincides with the right-hand side of (2.10.4). 

The relations (2.10.6) and (2.10.7) are obtained from (2.10.3) and (2.10.4) 
after we replace g(u, 4,1) by its expression in (2.8.31). 

We say in §2.8 that the functions g(x, a, 0) are solutions of various types of 
integral and integrodifferential equations, and even of special types of ordinary 
differential equations in the case of rational a 4 1. And though the function 
g(x, a, 0) represents only one of the solutions, a detailed analysis of its analytic 
extensions may possibly reveal (as can be seen by the example of an analysis 
of the Bessel and Whittaker equations) other linearly independent solutions. 

Here it is apparently the equations with the densities g(z,p/q,1) (p < q) 
as solutions which should be considered first and foremost. 

A good illustration is the case p = 1 and q = n + 1 > 2 connected with the 
equation (see (2.8.26)) 


y™ (E) =(-1"Ey(E)/(n +1), €>0, 


which as a solution the function 


y(€) = EF Dg(E-—41/(n + 1), 1). 
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Of interest is the connection between the densities of extremal stable laws 
and the Mittag-Leffler function (1:5) 


o0 r” 
Bolt) = 2 rae FI) g >00. 
THEOREM 2.10.2. For any0 <a < 1 and any complert s 
OO 
aEa(—s) =| exp(—sz)z~1—M/%g(¢7-1/o a, 1) dz. (2.10.8) 
0 
If 5 <@< lL. tien (C 
Co 
ak,(—s) = J exp(—sx)g(zx, 1/a, —1) dz. (2.10.9) 
0 


PROOF. Let s > 0. It is known [4] that in this case the function E.(-s) 
has the representation 


dz 


-n 1 1/a 
Eal) = ria | Pte ) (2.10.10) 


gta 


where the contour of integration L consists of the following three parts (z = 
zT + ty): the line Lı given by y = —-(tan p)z, where z varies from zt = +00 
to x = h, with h > 0 and ma/2 < p < ra; the circular arc Lz given by 
|z| = h/ cosp, —p < argz < p; and the reflection L3 of Lı with respect to 
the x-axis. We replace (z + s)~! in (2.10.10) by the equivalent integral 


f exp(—(z + s)u) du. 
0 


The double integral thus obtained converges absolutely; hence we can change 
the order of integration. Then 


Eaļ(—s) = p exp(—su) falu) du, 


where the function falu) can be transformed by integration by parts: 


1 
falu) = aoe f exp(z!/* — zu) dz 
1 1/a 1/ 
— e -gu jdr", 
r | PC zu) dz 


We make a change of variable, setting z = ¢%/u: 


a a 1 a —1/a 
falu) = >u (= | exp(—¢% + ¢u7}/ jas). 


Q 


The contour L* is the image of L under this change of variable. The integral 
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in parentheses is the inverse Laplace transform of the function exp(—s®) (0 < 
a < 1) and, consequently, represents the function g(u—!/%, a, 1) according to 
(2.6.10). 

The equality (2.10.9) is obtained from (2.10.8) by using the duality law for 
the densities (2.3.3) and the fact that the pair a < 1, 8 = 1 in the form (C) 
corresponds to the pair a’ = 1/a, 3 = —1 in the form (B). 

The left-hand sides of (2.10.8) and (2.10.9) can be extended analytically 
from the half-line s > 0 to the whole complex plane, as is clear from the 
definition of the function E,(z). On the other hand, 2~!~1/%g(x~1/2, a, 1) 
decreases with increasing z more rapidly than any function exp(—cz), c > 0. 
Therefore, the right-hand side can also be extended to the whole complex 
plane, i.e., the equalities (2.10.8) and (2.10.9) are valid for any complex s. 

Another interesting connection between the functions E,(z) and g(z, a, 1) 
reveals itself when the following equality (obtained in [32] for any ø > 0) is 
generalized: 


E,/2(z) = -T [ Es (zu ) exp (-) du. (2.10.11) 


THEOREM 2.10.3. Suppose that0 < a < 1 ando > 0. Then for any 
complex s 


Lg lS). = J Es(su7®™ )g(u, a, 1) du. (2.10.12) 


PROOF. We assume first that s = —z < 0 and compute the Mellin trans- 
form of the left and right sides of (2.10.12). 

Let 0 < a <1and0 <p < 1. Using the representation (2.10.8) and the 
explicit form (3.0.2) of the Mellin transform M(s,a,1) of g(z,a,1), we find 
that 


i: 2?-1 BE, (—2) dz = zf aaaea gig a, 1) du | et gP—! dr 
0 a Jo 0 
=T(p) | ual- Gu", a, 1) 
0 
_ T(p)PU—p) 
T(1—ap) ` 


Consequently, the Mellin transform of the left-hand side of (2.10.12) is equal 
to ['(p)'(1—p)/I'(1—aop). On the right-hand side we have the M-convolution 
of two functions (see Chapter 3 about this). Therefore, if 0 < ø < 1, then the 
Mellin transform of the right-hand side is equal to 


P(1—op) T(p)rU - p) _ Pp) -= p) 
[(1—aop) T(1-— øp) r(1- ap) — 
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Thus, the Mellin transforms of the right- and left-hand sides of (2.10.12) 
coincide. Consequently, (2.10.12) itself is true in this particular case when 
0 <o <1. It now remains to note that E,(s) represents an analytic function 
of ø in the half-plane Reo > 0, and we can thus extend (2.10.12) not only to 
the whole complex s-plane but to the whole semi-axis g > Q. 

There is one more consideration which speaks for regarding the densities 
g(x, a, 3) as a class of special functions. The so-called Airy function (or Airy 
integral) is well known in the theory of special functions. We have already 
encountered this function at the end of §2.8. It has the form 


A(z) = F cos(t? + tz) dt = k J exp(-it? — itz) dt (2.10.13) 
T Jo 2T 


and can be expressed in terms of Bessel functions as follows: for z > 0 


a= [a (292) -r (F) 


3V3 3V3 ai 
= uel a (55) > 0, 
Ao) = EE] 


The asymptotic behavior of the functions J.(z) and K,(z) is well known 
(see 8.351(1) and 8.452(6) in [27]), and it makes it possible for us to trace the 
behavior of the function A(z) as z — oo and z > —oo. Namely, as z > 00 


(32)-*/4 22,/z 
Mel “Spee (~ Fy) 
(32)—1/4 2c./z r 
A(—z) Ag a (Sy = 7) . 


Thus, A(z) is positive on the semi-axis z > 0 and decreases very rapidly as 
rz — oo. At the same time, it is of variable sign on the negative semi-axis and 
decreases no more rapidly than a power of z. 

A formula for inverting the Fourier transform of the function exp(—7t?) can 
be seen in (2.10.13). But then 


J explita) A(x) dx = exp{ (it)3}. (2.10.14) 


It is clear from the asymptotic expressions for A(z) that the Fourier trans- 
form (2.10.14) admits an analytic extension with respect to t into the half- 
plane Imt < 0. But this is equivalent to the existence of the two-sided Laplace 
transform 


J ex(s2) A(x) dx = exp(s*) 
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in the half-plane Res > 0. This formula is very reminiscent of (2.6.10) in the 
case 1 < a < 2 and suggests regarding the function A(z) as a “trans-stable 
distribution” corresponding to the value a = 3. This interpretation is further 
corroborated by the fact that on the semi-axis z > 0 the function A(z) turns 
out to be connected, not unlike the duality law, with the stable distribution 
g(x, + 1): 

2A(z) = 2~°9(z*, §, 1). (2.10.15) 


The relation (2.3.3) embraces only the part of the stable laws with a < 1. 
The equality (2.10.15) suggests using the other part of them to define “trans- 
stable” measures (more precisely, charges) under the assumption that the 
densities A corresponding to them form a three-parameter set of functions 
with the property that 


Alei!* 0,8.) = a g(e 08) (2.10.16) 


for any z > 0, A > 0, a’ = 1/a < 1, and |6’| < 1, where 1 +0 = a’(1+ 6’). 
The fact that the right-hand side of (2.10.16) is an entire analytic function 
enables us to extend A analytically from the semi-axis z > 0 to the whole 
r-axis. 
The next section is devoted to the realization of these ideas. 


§2.11. Trans-stable functions and trans-stable distributions 


The class W of strictly stable laws is the starting object for the follow- 
ing generalizations. As we know (Theorem C.4), each law in this class is 
characterized in the form (C) by three parameters (a,6,A), or by the three 
parameters (a, p, À), where p = (1 + 0)/2. In this case the parameter A is 
purely a scale parameter, because 


gc(z, Q, P, A) = A Nagal Ne, Q, P, Íj. 


Therefore, without any detriment to the purposes of the projected general- 
ization we can set \ = 1 and operate with only the two parameters a and p. 
According to (2.4.6) and (2.4.8), the densities gc(x, a, p) can be represented 
by convergent power series. We use these representations here in a somewhat 
extended variant, including also the case a = 1. Namely: 


Iif0<a<1,0<p<l,andz>0,orifa=1,0<p<1,andz>1, then 


['(na + 1) 


: —an—1 
Tin +1) sin(mapn)x (2.11.1) 


= l = n—1 
go(z, Q, p) -= T D 
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Ifl<a<2,1-1/a<p<1/a,andz€ R!, or ifa=1,0<p<1, and 
lz| < 1, then 


sin(rpn)r”t, (211.2) 


It is not hard to see that the series (2.11.1) and (2.11.2) remain convergent 
if, keeping the restrictions on the variation of xr, we extend the domain of 
variation of (a, p) in the first case to the strip 0 < a < 1, p € Rt, and in the 
second case to the half-plane a > 1, p€ RŁ. 

It is thus possible to define the functions g(z,a,p), a > 0, p € R}, for 
the indicated values of z in the representations of the densities (2.11.1) and 
(2.41.3). 

This enables us, in turn, to define in the complex plane Z (possibly with 
cuts) the family of analytic functions 


T = {o(z,a,p):a>0, pe R}}, 


by setting them equal to the corresponding functions g(x, a, p) on the parts 
of the real axis r = Rez where the latter were defined. It is easy to verify 
that in the case a = 1 the analytic functions defined by means of (2.11.1) 
and (2.11.2) coincide if the parameter p is given the same value, i.e., o(z, 1, p) 
represents a single analytic function, independent of the way in which it was 
defined. 

We mention several simple properties of the functions in T. 

1*. If œ > 1, then, for any p, o(z, a, p) is an entire analytic function. 

2*. If a = 1 and p is not an integer, then o(z,1,p) is a meromorphic 
analytic function on Z with two simple conjugate poles that are solutions of 
the equation (see (2.3.5a)) 


z? +2zcosrp+1=0. 


The function a(z, 1, p) is equal to zero for p and integer. 
3*. If 0 <a<1,2>0, and ap is an integer, then alsa p =O 
4*. Ifa > 1 and z> 0 or ifa = 1 and0 <z <1, then 


a 


z~°o(z~*,1/a, ap) = rolt, a, p). (2.11.3) 


This equality is established by comparing the power series representing its left 
and right sides. 

Properties 3* and 1* enable us to understand what the functions o(z, a, p) 
are in the case when a < 1 and ap is not an integer. Indeed, by (2.11.3), if 
0 <a< 1 and ap is not an integer, then 


a(z,a,p) = x %o(r~%,1/a, ap). (2.11.4) 
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According to the property 1, the function o(z, 1 /a,ap) is entire, and z® is 
the elementary multivalued function on Z with branch points at z = 0 and 
z = œ. Consequently, the superposition of these functions on the right-hand 
side of (2.11.4), and with it the analytic extension of the left-hand side, i.e., 
the function o(z,a,p), is a multivalued analytic function on Z with branch 
points z = 0 and z=. 

In the case when 0 < a < 1 and ap is not an integer, o(z,a,p) wll be 
understood to be the principal branch of this function defined in the whole 
complex plane with a cut along the ray argz = 7 (the lower edge of the cut 
is included in the domain of definition). 

DEFINITION. Let To be the set of analytic functions formed by all the 
functions o(z,a,p) in T with a > 1 or with 0 < a < 1 and ap an integer, 
together with the principal branches of all the functions o(z,a,p) in T with 
0 <a <1 and ap not an integer. 

The functions in To are called trans-stable functions. 

Let us proceed to the properties of trans-stable functions (%35). 


PROPERTY 1. Each function o(z,a,p) € Zo is periodic in the variable p, 
with period Tẹ = 2min(1,1/a), t.e., 


o(z,a,p+ Ta) =0(z, 4, p). 
PROPERTY 2. For any function o € Yo 
o(z,a,—p) = —o(z, a, p). (2.11.5) 
PROPERTY 3. If a> 1, then for any complex number z 
o(—z,a, p) =a(z,a,1-—p), (2.11.6) 
zo(z,a,p) = z %o(z %,1/a, ap). (2.11.7) 


The left-hand side of (2.11.7) is an entire function, and the right-hand side 
is a superposition of principal branches of multivalued functions. If these 
functions are defined on complex plans with identical cuts, then their super- 
position has removable singular points on the whole curve of the cut; these 
singularities can be removed by appropriately extending the definition of the 
superposition. This gives an entire analytic functionon the right-hand side of 
(2.11.7). 

Properties 1-3 can be verified by elementary transformations of the series 
representing the functions in Yo. 


PROPERTY 4. o(z,a,p) =0 on the semi-azis x = Rez > 0 if and only if 
pmin(1,a) ts an integer. 
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The sufficiency of this condition is obvious. The necessity becomes clear 
if we write out the first term of the asymptotic expression for the function 
a(z, œ, p) as x — œ in the case a < 1 and as x — 0 in the case a > i. 


PROPERTY 5. Suppose that a > 0, 0 < P-S Ioa, and g > 0. Then 
a(z, œ, p) > 0 if and only if 0 < p < min(1, 1/a). 


This property can be established as a consequence of the general duality 
relation (2.11.7), the positivity of the function o(z, Q,P) = go(z, a, p) in the 
case when 0 < a < 1 and 0 < p < 1, and the periodicity of o(z,a,p) with 
respect to the parameter p. 

Figure 4a should help familiarize the reader with these properties; the do- 
mains corresponding to various properties are indicated there in the half-plane 
of values of the parameters (a, p). The numbers with asterisks correspond to 
the domains obtained in passing from a > 1 to a* = 1 /a in the duality rela- 
tion (2.11.7). Here one must bear in mind that the mapping of the domains 
takes place in the form of a curved reflection with respect to the line a = 1, 
i.e., the point (2,5) passes into (5,1), the curve a > 1, p= 1/a passes into 
the segment 0 < a < 1, p = 1, and so on. 

The domains marked by the numbers 1 and 1* are doubled because of the 
periodicity property of the function o(z,a,p). However, while in the case 
a > 1 we have the period T, = 2 and the domain 1 is simply shifted, in the 
case 0 < a < 1 the period Ta is 2/a and the duplicates of the domain 1* are 
distorted. 

We should note especially the equality (2.11.6). It shows that the values of 
p symmetric with respect to the half-line a > 1, p = 3 correspond, as it were, 
to mirror reflections of the functions ø. At the same time, the functions o with 
values 0 < a < 1 do not have this property. This may seem strange, since the 
densities gc(z,a,) with values 0 < œ < 1 have the following property (see 
(2.1.3)): 


go(z, a, p) = g(x, a, 0) 
= 9(—2, a, =f) = gc(—z, <M E p). 


The disagreement between the properties of the functions gc and ø is 
explained by the fact that ø is a branch of a single analytic function, 
while gc(z, a, p) is a patching together of the halves gc(z,a,p), x > 0, and 
9c(z, a, p) = gco(—z,a,1—p), z < 0, of different analytic functions. The du- 
ality relation (2.11.7) connects the halves (on the semi-axes z > 0 and z < 0) 
of one and the same analytic function o(z, a, p), a > 1, with different analytic 
functions ø having one and the same value of the parameter a’ = 1/a, but 
different values of the second parameter p’. 
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PROPERTY 6. Suppose that a > 1 and |p| < 1,1.e., we consider points 
(a,p) in a strip where the periodicity with respect to p does not yet manifest 
itself. The Fourier transform a(t, a, p) of the function o(z,a,p) exists if and 
only uf |p + 5 | < (2a)~!, which corresponds to the domain bounded by the 
curves p = (1 + 1/a)/2 and p = (1—1/a)/2 and the domain symmetric to it 
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with respect to the a-azis. In the first domain 
log a(t, a, p) = —|t|* exp{ima(4 — p) sgn t}. (2.11.8) 


In particular, ô(t, a, 4) = exp(—|t|*), a > 1. The function o corresponding 
to the parameters (a, 4) are symmetric. 

For the points (a, p) on the curves p = (1+ 1/a)/2 the Fourier transforms 
o of the functions ø have, respectively, the forms 


a(t, a, p) = exp(+i|t|* sent). (2.11.9) 


The existence of the Fourier integral ô depends directly on the behavior of 
g as T > oo and as x => —oo. The bounds p = $(1+1/a) and p = —4(1+1/a) 
distinguish in the half-plane a > 1 a domain in whose interior o(z, a, p) > Oas 
x — oo. The proof of this fact is based on the use of integral representations 
of the type (2.2.8) and (2.2.10) for the functions ø. According to (2.11.6), 
a(z, œ, p) > 0 as x => —o interior to the domain bounded by a > 1, p= l 
and a > 1, p = (1 — 1/a)/2 and interior to the domain symmetric to it with 
respect to the a-axis. 

In only four cases do the densities of stable laws have explicit expressions 
in terms of elementary functions (we have in mind, of course, the main pa- 
rameters a and 3). They were listed in §2.2. In some cases the densities of 
stable laws can be expressed with the help of special functions (see §2.8). 

So far, among the trans-stable functions that are not densities of stable 
laws no examples analogous to the four mentioned above are known, with 
the exception of the obvious negative variants which appear due to Property 
2. However, it is sometimes possible to express the functions ø in terms of 
special functions. We already encountered one such example in the preceding 
section: the Airy integral A(r) = o(z, 3, 4). 

We indicate another case when the function ø can be written with the help 
of special functions: Fresnel integrals. These integrals were used for writing 
the densities g(x, 5,0) in §2.8 (formula (2.8.30)). 

By using the duality relation and (2.8.30) it is possible to write the function 
a(x, 2, +) as follows: 


1\ 3 (911 
o(2,2,7] =r (z 19? 9 
p ESA 


178 2. ANALYTIC PROPERTIES OF DISTRIBUTIONS IN 6 


We note that to the function o(z, 2, +), like o(z, 3, t), there corresponds a 
Fourier transform of the form (2.11.9): 


aitai) exp(—it? sgn t). (2.11.10) 


PROPERTY 7. In the case when a > 1 and |p| < 1 the function |o(z, a, p)| 
decreases as £ —> œo more rapidly than exp(—cz) for any c > 0 if and only 
if alp| = 1 (ie., the points (a,p) must le on the curves p = +1/a). The 
symmetry property (2.11.6) indicates that the function o(z,a,p) decreases this 
rapidly as x + —oo only if the point (a, p) lies on the curves |p| = 1 — 1/a. 


This implies the following facts (in the strip a > 1, |p| < 1): 

a) The function |o(z, a, p)| is decreasing more rapidly than an exponential 
both as z — oo and as z — —oo only in the cases when the points (a, p) 
lie on the intersection of the curves |p| = 1/a and |p| = 1 — 1/a, i.e., when 
a = 1, p= 0 or when a = 2, p = +4. The first point corresponds to the 
case o(z, 1,0) = 0, and the two other cases correspond to the density of the 
normal law o(z, 2,5) and its negative variant 


o(z,2,-4) = —o(z, 2, 5). 


b) Let s be a complex number in the half-plane Res > 0. The two-sided 
Laplace transform 


s(s,a,p) = | exp(s2)o(«, o, p) da 
exists if and only if 1 < a < 3 and |p| = 1/a. In these cases 
¢(s,a,+1/a) = + exp(s®). 


c) For any complex number s the one-sided Laplace transform of o(z, œ, p) 
exists if and only if |p| = 1/a. In these cases 


J exp(sz)o(z, a, +1/a) dx = +E1/a(8)/Q, 
0 
where E,(s) is the Mittag-Leffler function (see Theorem 2.10.2). 


PROPERTY 8. Suppose that a > 0 and |p| < Ta/2. Consider the Mellin 
transform of the function xzo(z, a, p): 


oo 


R(s, a, p) af r°o(x, a, p) dz. 
0 


If0 <a < 1, then R(s,a,p) exists for values s in some neighborhood of 
the point s = 0 for any p. But if a > 1, then R(s,a,p) exists for s in some 
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neighborhood of zero of and only if |p| < (1+1/a)/2. If R(s,a,p) exists, then 
it exists for all —1 < 8 < a, and 
sin(zps) P(1 — s/a) 
sin(7s) T(1—s) ` 
The first part of this assertion follows directly from the representation of 
o by the series (2.11.1). The second part can be proved like Property 7 with 
use of the integral expressions for ø. The positivity of o(x,a@,p) in the case 
a > 2,0 < p < 1/a enables us to introduce a distribution connected with ø. 
Since, according to (2.11.11), 


(e e) 
f glz apdr = p > 0, 
0 


the distribution can be given by the density 


fils, ap) = (2.11.11) 


p(x, a, p) = a(x, a, p)/p. 


This distribution is said to be trans-stable, and the random variables cor- 
responding to it are denoted by Z(a, p). 

The fact that the notation for trans-stable random variables is identical 
with that for the cutoffs Z(a, p) introduced in §3.1 is not accidental. 

It turns out that trans-stable random variables satisfy the same relations 
as the cutoffs (of course, with a corresponding broadening of the domain of 
variaticu of the parameters a and p). Naturally, this is connected with the 
fact that trans-stable random variables Z(a,p) have transforms R(s, œ, p) = 
EZ*(qa, p) of precisely the same form as the transforms M (s, a, p) = EZ: (a, p) 
for the cutoffs (see (3.0.2)). 

As in the case of cutoffs, the definition of trans-stable random variables cor- 
responding to the value p = 0 can be extended in a natural way by condition 
(3.1.4) in the case when a > 2. 

The following can serve as examples of relations between trans-stable ran- 
dom variables analogous to those to be considered in detail in §§3.2-3.4. 

If a* >1,a>1, and 0 < p < 1/a, then 


z(a*,1/a*)2/* (a, p) £ z(aa*, p/a*), 


z(a, 1/a)/z(a, p) = z(1,p) 


(recall that, according to the convention we have adopted, the random vari- 
ables written especially on the left-hand side of the equalities are assumed to 
be independent). 


CHAPTER 3 


Special Properties of Laws in the Class 20 


83.0. Introduction 


The analytic properties considered in the preceding chapter for the laws in 
the class G are based on explicit expressions for the characteristic functions 
of these laws. The fact that the characteristic functions are an analytic tool 
especially suited for working with sums of independent random variables was 
reflected in the material of Chapter 2. For example, this is very clear from 
§2.1, where the properties of stable laws are written in terms of sums of 
independent random variables. 

Mellin tiansforms and characteristic transforms of probability distributions 
serve as the analytic basis for the investigations carried out below. These 
transforms fulfill the same purpose in the multiplication scheme for indepen- 
dent random variables (the M-scheme) that characteristic functions have in 
the addition scheme. Therefore, a large part of the material in the present 
chapter involving various relations for distributions in 20 is presented in terms 
of products of independent random variables. 

As we saw in I.5, the form (C) is a natural parametrization system for 
strictly stable laws. Recall that each distribution G € W is uniquely identified 
by a triple (a, 6, A) of parameters varying within bounds given by 


O<a<2, |0|<0.=min(1,2/a—1), A>0 


(the case a = 1, |#| = 1 corresponds to a degenerate distribution at the point 
xr = 0), or by a triple (œ, p, à) of parameters, where p = (1+6)/2 varies within 
the bounds 

1 — min(1,1/a) < p < min(1, 1/a). (3.0.1) 


In the distributions Go(z, a, 6, à) the parameter A has to do with the choice 
of the scale on the z-axis. This is shown by the equality 


Go(z, a, 0, A) = Ge(rzdA7~/%, a, 6, 1), 
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which is valid for any x and any admissible values of the parameters a, 6, 
and À. 

If a £ 1, then Ac = Ag, and thus the condition Ac = 1 distinguished the 
subclass of standard distributions in 29. In the case a = 1 the choice Ac = 1 
does not correspond to a standard distribution. If |@| # 1, then 


Golet 1, 6,1) = Gaz(z,1,0,7, (2/4 + age AA}, 


so the set of distributions Go(z, 1,0, 1) standardized by the condition Ac = 1 
is a set of specially normalized Cauchy distributions whose mode biases do 
not go outside the interval (—1, 1). The case |6| = 1 is a limit case for this 
set and corresponds to the distribution Gc(z,1,0,1) = D(x — 0) degenerate 
at the point @. 

The set of distributions singled out in W by the condition Ac = 1 is 
denoted by 23,. The nature of the material making up the main content of 
this chapter permits us to consider only the distributions in W, without loss 
of generality. For brevity let Gc(z,a,0) = Go(z, a, 0,1). 

Further, the symbol Y(a,@), without a subscript C required, will stand for 
random variables with the distribution Go(z,a,0). Along with the parameter 
system (a,@) we use a slight variant of it—the system (a,p), where p = 
(1+ 0)/2. To avoid confusion it is well to keep in mind that the letter 0 
always denotes the parameter in the first system, and p the parameter in the 
second system. 

The results in §2.6 give the following expressions for the Mellin trans- 
form (normalized by its value at the point s = 0 under the assumption that 
this value is positive) and for the characteristic transform of distributions 
in Wy: 


mc(s,a,p) _ sinmps r(1-— s/a) 


M(s,a, p) = aa a” 
W (s, aœ, 0) = wo(s, œ, 0) 0 (3.0.2) 
1,4; B 0 wı(s,&,0) ; 
where 
welga, 0) = EY (e0 (sgn Y (a, 0))* 


_ cos $(k — 0s) P(1 — s/a) 


In both transforms s can take any values in the strip —1 < Res <a. 
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To avoid awkward formulas it makes sense to agree on some more notation. 
If X is a random variable, then for any complex number s we stipulate that* 


A” = |X" sen X. (3.0.4) 


with 0° = 0 for any s. If X is nonnegative with probability 1, then this 
generalized understanding of a power obviously coincides with the traditional 
one. Powers of random variables in the sense of (3.0.4) clearly have the basic 
properties of ordinary powers: 


(X) = X*, XiX = (X1X2)°, 
but there is also an essential difference, since 
X? = sen X, (—X)* = —X* 


for any s. 

Here it is appropriate to point out a peculiarity of equalities between ran- 
dom variables in the sense of “4”. We have already encountered them in §2.1 
and will make repeated use of them in this chapter. If X’+ X 5X" 4 X or 


X'XE X" x, (3.0.5) 
then it does not follow in general that 
x's x". (3.0.6) 


However, if the characteristic function fx(t) is nonzero for almost all t or, 
correspondingly, w,(t)x (the elements of the characteristic transform Wx(t) 
are so denoted; see §2.6) are nonzero almost everywhere, then this equality is 
valid. Of course, the converse is valid without any restrictions with regard to 
the distribution of X. 

The use of relations between random variables (as done in §2.1) in place 
of writing out relations between the corresponding distributions makes even 
more sense in the present chapter than in §2.1. The fact is that we shall have 
to deal repeatedly with products X = X; X> of independent random variables, 
and the distribution function F of the product X is expressed in terms of the 
distribution functions F, and F of the factors in a form considerably more 
cumbersome than, say, the convolution of F 1 and Fz. Even in the simplest 
case when F; and F> are continuous, 


oe) 0 
F(z) = J F, (z/y) dF(y) + / (1-Fy(2/y))dFy(y). (3.0.7) 


— 00 


*See the second part of Comment 3.1 at the end of the book. 
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In what follows we encounter only this case, so those wishing to carry over 
the relations between random variables to the language of the corresponding 
distributions should use this equality (*'").* 


§3.1. The concept of a cutoff of a random variable 


DEFINITION. A cutoff of a random variable X with continuous distribution 
function F(x) not entirely concentrated on the negative semi-axis is defined 
to be any positive random variable X with distribution function (%2) 


F(z) — F(0) 


F(z) = 1 — F(0) 


= P(X < 2|X > 0), g > 0. 


There are constructions enabling us to construct cutoffs X as functions of the 
random variable X. One such construction is well known: 


> A 7 “F(X)+c¢ _ _ F(0) 
X= FPO) = Fa ( EP i? where ¢ = 7 F0) < © 
But this construction is trivially nonunique. For example, setting 
T if g> 0, 
i da are _—F(z)/c) if <0, 


we can assert that the random variable X = S(X) is a cutoff of X if 0 < c < 
OO. 
Indeed, let x > 0. Then 


P(S(X) < x) = P(S(X) < 2, X = 0)+ P(S(X) < 2,X <0) 
sP <9, A 20) 
+ P(F—(1 — F(X)/c) < 2, F(X) < F(0)) 
= P(X < 2|X > 0)(1 — F(0)) 
+ P(c(1 — F(a)) < F(X) < F(0)). 
Since P(X < 2|X > 0) = F(x) and F(X) has a uniform distribution on (0, 1), 
it follows that 
P(S(X) < 2) = (1 — F(0))F(x) = F(0) — e(1 — F(z)) 
= (1 — F(0))F(z) + F(0)F(z) = F(z). 


Let us single out some useful properties of cutoffs. 
1. If a is a positive constant, then 


(aX) =aX. 


*Raised numbers in this form refer to the corresponding notes in the comments at the 
end of the book. 
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2. If X, É X3, then 
Rek (=X) 4(=X), 
and conversely (the last equality is not considered in the case when 
P(X, > 0) = = 1). 

3. Let Fı and F; be continuous distribution functions not entirely concen- 
trated on the negative semi-axis, let cg = Fy(0)/(1— F,(0)), and let u(F;, F2) 
be one of the three well-known distances: the Lévy metric, the uniform metric, 
or the total variation distance. Then 


u(Fy, Fp) < (l+c,)u (Fi, F2) + |c2 — cil. (3.1.1) 
From this it follows, in particular, that if Fo, Fi,... is a sequence of such 
distribution functions, then the convergence u(Fn, Fo) > 0 as n > œ implies 


ul Fr, Fo) 30. 

4. If the random variable X has a continuous asap inganeen’ distribution, 
i.e., F(—z) = c(1 — F(z)) for z > 0, where 0 < c < 00, then Ñ £ |X|. 

5, In the sets of cutoffs of independent random variables X 1 and X> there 
are cutoffs X; and Xə that are independent random variables (for example, 
A= S(Xı) and Ñ? = $(X2)). 

6. Let X be a cutoff of a random variable X having a finite Mellin transform 
R(s) = EX*I(X > 0) in some strip —c, < Res < co, where I is the indicator 
function of an event. Then 

R(s)/R(0) = EX”. (3.1.2) 

All the properties except perhaps property 3 are of an elementary nature 
and do not require special proofs. In property 3 the cases of the uniform 
metric and the total variation distance are also uncomplicated. Therefore, we 
give a proof of the inequality (3.1.1) only for the case when p is the Lévy 
metric. According to the definition of this metric, if € = (Fy, F>), then for 


any T ` 
Fi(z — £) — e < Fo(z) < Fi(x +€) +e, 


and it is impossible to decrease £ while preserving the inequalities for all z. 
Let us multiply the sides of the inequalities by (1 + cı) and subtract cı from 
them. Considering that F(x) = (1+ ck)Fk(£)— ck, £ > 0, we have that 


fi(z — e€) — (1 +c1)e < (1 + c1)F2(£) — c1 
= Fz(z) + (c2 — c1)(1 — F(z) 
< Fi(a+e) 4+ (Laje. 
Setting 6 = (1 + cı )e + |c2 — cı|, we see that for all z 
F(z — 6) -6 < Fo(z) < Êi(z +6) +6. 


Consequently, u(F;, Fy) < 6. 


186 3. SPECIAL PROPERTIES OF CLASS W LAWS 


We shall often have to deal with cutoffs of stably distributed random vari- 
ables in W1; therefore, we adopt a separate notation for them, setting 


Z(a,p) = Y(a,8). 


If a < 1,p= 0 or ifa = 1,p = 1, then the cutoff Y (a, 0) is not defined. In the 
first case the distribution of Y (œ, —1) is concentrated on the semi-axis £ < 0, 
and in the second case Y (1, 1) £ 1, i.e., its distribution is discrete. 

In these cases the definition of the concept of a cutoff should, of course, 
be extended in such a way as to preserve the property of continuity of the 
distributions of the cutoffs with respect to the parameters a and p in their 
whole domain of variation—a property enjoyed by the distributions of the 
random variables Y (a, 6). 

The case a = 1, p = 1 turns out to be simplest, since we get what we want 
by setting Z(1,1) = 1. Namely, 


Z(a,p)>1 asa—>1,p— 1. (3.1.3) 


In the case a < 1,p = 0 the situation is more complicated, and it turns out 
to be correct to set Z(a,0) equal not to zero (as suggests itself) but to the 
random variable 

Z(a,0) = Z(a,1)E/E, (3.1.4) 
where E is a random variable with the exponential distribution 
P(E > x) = exp(—2), 7 > 0. 


Recall that, according to our permanent condition, all the random vari- 
ables written separately on the right-hand side of (3.1.4) are assumed to be 
independent. 

In explaining why this extension of the definition of a cutoff is the natural 
one we are aided by the characteristic transform Wz/(s,a,p) of the cutoff 
Z(a, p), which, since Z(a, p) > 0 with probability 1, has the matrix form 


Wz(s, a, p) = be ney H 


where M (s, a, p) is the Mellin transform of the distribution of Z (a, p). 
If œ < 1, then p varies from 0 to 1, and for p > 0 
sin tps T (1 — s/a) 


M = EZ° = 
(s, a, p) = EZ*(a, p) en n 


If a is fixed and p — 0, then clearly 


M(s,a,p) > 


M(s,a,1) =T(1—s)P'(1+s)M(s, a, 1). (3.1.5) 


sin ms 
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Since (1 + s) (or T(1 — s)) is the Mellin transform of an exponential 
distribution, i.e., the random variable E (respectively, the Mellin transform 
of the distribution of the variable 1/E), (3.14) follows from (3.15). 

We now proceed to systematically study the properties of the distributions 
of the random variables Y(a,@) and Z (a, p) defined at the beginning of this 
chapter. 


§3.2. The random variables Y (a,0) and Z(a, p). 
Equivalence theorems 


We begin from the simplest relations. For any pairs (a@,0) of admissible 
values 
-Y (a, 0) ŻY (a, —6). (3.2.1) 
If (0| = 1, then 
Y (1,0) 40. (3.2.2) 
For any pairs (a@,6;) and (a, 02) of admissible values 
Y (a, 41) d Y (a, 02) 
Y(a,62) Y(a,61) 


In particular (3-3), for any -1<6<1 


(3.2.3) 


Y~1(1,0)£Y(1, 6). 


Recall (this was stipulated in §I.3) that, in all the equalities “ a» connecting 
functions of random variables in the sense that their distributions are equal, 
the random variables written separately on one side of the equality (such as, 
for example, Y (a, 01) and Y (a, 62) in (3.2.3)) are assumed to be independent. 

All these relations can be proved by a single universal method. The charac- 
teristic transforms of the distributions of the left and right sides of the equali- 
ties are computed and compared. Since the characteristic transforms, like the 
characteristic functions, uniquely determine the distributions connected with 
them, coincidence of the characteristic transforms means coincidence of the 
corresponding distributions. 

For instance, using (3.0.3), we find that 


(—1)* we (s, a, 0) = wk(s, a, —9), k= 0,1, 
which implies (3.2.1) (note that (3.2.1) is a particular case of Property 2.2). 
Further, if |0| = 1, then 


cos 5 (k — s0) 


— m = 6k k= 0.1 
cos 3 (k — s) ae 01, 


wz(s, 1,0) = 
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which is equivalent to (3.2.2). In the third case 
cos Z (k — s01) T (1 — s/a) 
cos3(k—s) T(1—s) 
cos Z (k + s02) T(1 + s/a) 
cosZ(k+s) T(1+s) 


wk(S, Q, 0i )wk(— 8, Q, 02) a 


Since 
cos 5 (k — s0) _ cos Z (k + 36) 


cos $ (k — s) ~— cos3(k+ s) 
for the values k = 0,1, we get after the appropriate change of signs and a 
permutation of the factors that 


we (s, a, 01) we(—s, @, 02) = wr(—8, a, 01 )we(s, a, 82), k=0,1, 


i.e., the distributions of the left and right sides of (3.2.3) coincide. A special 
case is obtained if we set a = 1 and 6; = 1 in (3.2.3) and use (3.2.2). 

The following relations are among the simplest relations: 

If0 <a <1, then Z(a,1) = Y(a, 1). 

For any admissible values of a 


Z(a,4)£|Y(a,0)]. (3.2.4) 
For any pairs of admissible values (a, p1) and (a, p2) 


Z(a, pı) Z(a, p2) 

= ——_, 3.2.5 
Z(a,p2) Zla pr) ene 
except for the case when one of the numbers pı or p2 is zero. In particular, if 
0<p< 1, then 


Z-1(1,p)£ Z(1, p). 


The first and second equalities are simple consequences of property 4 of 
cutoffs. The validity of (3.2.4) and (3.2.5) is established by computing and 
comparing the Mellin transforms of their left and right sides. The random 
variables X; = |Y (a,0)| and X2 = sgn Y (a, 0) are independent if and only if 
either 6 = 0 or 0 < a <1 and || = 1. 

In term of characteristic transforms this property is equivalent to the equal- 
ity Wx,x,(s) = Wx, (s)Wx,(s), which in our case is easily seen to reduce to 
the requirement that 


w1(s,a,0) = Owo(s, a, 0), 


which implies the indicated condition. 
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THEOREM 3.2.1. Let (1,61) and (a2,62) be pairs of admissible values 
of parameters, and let v be a number such that 


(1 + |01|)/2 < v < (ay (1+ |O2|)/2)~?. (3.2.6) 


Then 
Y (a ’ 6 )Y” (a2, 02) £ Y (az/v, 6 )Y” (ayy, 02). (3.2.7) 
PROOF. Let us compute the characteristic transform of the left-hand side 
of (3.2.7). Using (2.6.23) and (3.0.3), we have that (k= 0,1) 
cos 5(k — s01) r (1 — s/a) 
cos5(kK—s) T(1-—s) 
cos 3 (k — sv02) T (1 — sv/a2) 
cos (k -sv) T(1- sv) 


wk (s, &1, 01 )wg (sv, a2, 02) = 


Similarly, for the right-hand side 
cos 3 (k — 618) T(1 — sv/ag) 
cosž(k-s) IT(1—-s) 
cos 3 (k — svb2) T (1 — s/ay) 
cosF(k-—sv) T(1-sv) ` 

It is obvious that the expressions coincide. It remains to verify that the 
parameters in the right-hand side of (3.2.7) are in the domain of admissible 
values. We know that |6,;| < min(1,2/a; — 1), and therefore the conditions 


Q2 
Wk (s, 22 A) Wk (SV, AV, 02) = 


[01| < min(1, 2v/a2 — 1), |@2| < min(1,2/(a,v) — 1) 
needed for the right-hand side hold if and only if 
ði] < 2v/aq —1, }92| < 2/(ayv) —1. 
But these inequalities are equivalent to (3.2.6). 


THEOREM 3.2.2. Let (a1,6;) and (a2,02) be pairs of admissible values 
of the parameters, and let u be any real number such that 


|@1|/min(1,2/a2 — 1) < |u| < min(1,2/a; — 1)/|69I. (3.2.8) 
Then 
Y (a1, 01) Y“ (a2, 2) = ¥ (o1, 02u)Y” (a2, 01/4) 
£Y (a1, —b2u)Y" (az, -01 /p). (3.2.9) 
The proof of the first part of (3.2.9) is just like that of (3.2.7). The second 


part is a consequence of (3.2.1). 
We mention another relation following from (3.2.9) and (3.2.7). 
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COROLLARY. For any pairs (œ1,01) and (a2,2) of admissible values of 
the parameters and any number u > 0 such that 
}1|/u < min(1,2/(a1, 4) — 1), 
\O2|u < min(1,2u/a2 — 1), (3.2.10) 
the relation 
Y (a1, 01)¥" (a2, 02) =Y (a2/p, O24) ¥" (orm, 01/4) 
Y (a2/p, —92u)Y” (arp, —41/p) (3.2.11) 


la Ie 


is valid. 
The cutoffs Z (a, p) satisfy analogous relations. 


THEOREM 3.2.3. For any pairs (ay, p1) and (a2, p2) of admissible values 
of the parameters and any number v such that 
a (lor — 41+ 4) < v < (arlp2 — $| + 01/2), (3.2.12) 
the relation 
Zlo, p1)Z" (a2, p2) = Z(a2/v, p1)Z” (av, p2) (3.2.13) 
is valid. 

PROOF. Let us compute and compare the Mellin transforms of the dis- 
tributions of the left and right sides of (3.2.13). They coincide if and only if 
the theorem holds, provided, of course, that the parameters of the random 
variables on the right-hand side of (3.2.13) lie in the domain of admissible 


values. 
According to (3.0.1), 


sin mp8 T(1 — s/aı) 


M M = 
(s,&œ1, p1) (sv, &2, p2) pı sin ns T(1—s) 


d sin Tpovs T (1 — sv/az) 
posinnvs T(1-— sv) 
sin rpıs T (1 — sv/a2) 


Q2 
re ew - 
(s. p „p1 ) (sv, a1V, p2) pı sin 7s r(1 = s) 


sin tpgvs T (1 — s/a) 
p2sintvs T(1—sv) | 

It is obvious that these transforms are equal. By (3.0.1), the pairs (a1/v, p1) 
and (œv, p2) are in the domain of admissible values if and only if 


lor — $| < min(1,v/a2) — 5, 


Ip2 — $| < min(1,1/(aav)) = 3. 
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If we consider that, according to a condition of the theorem, 
lp2 — 5| < min(1,1/a;) — 4, 
lp2 — 5| < min(1, 1/a2) — 4, 


then it becomes apparent that the condition written for v is equivalent to 
(3.2.12). 


THEOREM 3.2.4. Let (a1,p1) and (a2, P2) be pairs of admissible values 
of the parameters, and let u be a real number such that 


1 — min(1,1/a1) < |u|p2 < min(1, 1/a;), 
1 — min(1,1/a2) < p;/|u| < min(1,1/a2). (3.2.14) 
Then 
Z(o1, p1)Z" (a2, p2) = Z(a1, plul) Z” (a2, p1/lul). (3.2.15) 


The proof is by the same method used to prove the preceding theorem. We 
note a corollary of (3.2.14) and (3.2.15). 


COROLLARY. For any pairs (a1,p1) and (a2, p2) of admissible values of 
the parameters and any u > 0 such that 


le1/u — 4| < min(1,1/(a,p)) — 5, 
Pou _ 3| < min(1, /a2) Bi 5, (3.2.16) 


the relation 


Z (a1, p1)Z" (a2, pr) = Z(a2/p1, pou) Z" (a4 p, p/p) (3.2.17) 
as valid. 


REMARK. Condition (3.2.16) can be replaced by explicit inequalities de- 
termining the intervals of variation. They are not given because of their 
unwieldiness. 


THEOREM 3.2.5. Let a > 1,p be a pair of admissible values of the pa- 
rameters. Then 
Z(a,p)£Z-*/*(1/a, ap). (3.2.18) 


PROOF. Let us compute the Mellin transforms of the distributions of both 
sides of (3.2.18). We have that 


’ 


_ sinmps ['(1 — s/a) 
aE er psinrs T(1-— s8) 
ae | _ SİNTPS r(1 +s) 

m (-2. Lap) ~ apsin(rs/a)T(1+ s/a)” 
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Consider the ratio of these transforms: 
_ asin(rs/a) P(1 — s/a)P(1 + s/a) 
sin ts T(l1—s)f(it+s) | 
Since ['(1 — s)I (s) = 77/sin ms, it follows that 
_ sin(ts/a@) sin7s 
~ sinms sin(ts/a) 
The Mellin transforms coincide, so the distributions corresponding to them 
coincide, i.e., (3.2.18) is valid. According to (3.0.1), the condition that the 
pair (1/a,ap) belong to the domain of admissible values amounts to the in- 
equalities 
1 — min(1,a) < ap < min(1, qa), 
which are obviously equivalent to the condition 
1 — min(1,1/a) < p < min(1,1/a). 
But this is the condition for (a, p) to belong to the domain of admissible values 
if it is taken into account that a > 1. 
REMARK 1. (3.2.18) is none other than the duality relation constituting 
Theorem 2.3.1. Thus, here we have a second proof of that theorem. 
REMARK 2. Choose p = 3 in (3.2.18), which corresponds to the value 
6 = 0. Considering (3.2.4), we get in the case a > 1 that 


IY (a, 0)|4 Z-*/°(1/a, a/2). (3.2.19) 

With a = 2, this gives us for a normally distributed random variable Y (2,0) 
with expectation 0 and variance 2 that 

Y (4,1) |Y (2,0) -2 N-?/2, (3.2.20) 


where N is a random variable with standard normal distribution. 
There is apparently no analogue of (3.2.18) for the random variables 
Y (œa, 0). However, we have the following relation. 


THEOREM 3.2.6. Let a > 1,0 be a pair of admissible values of the pa- 
rameters, and O < p < 1 any number.* Let v = (2p)~'. Then 
Z” (1, p/a)lY (a, 0)| È Z“ (1, p)|¥ (1/a, a0)". (3.2.21) 
PROOF. Let us compute the Mellin transforms of both sides of (2.2.21), 
recalling that the Mellin transform of the distribution of |Y (œ, 0)| coincides 
with woọo(s,&œ,0). For the left-hand side, 
p asin(nvps/a) cos 58s T(1 — s/a) 
M 15 puaa iri ai aa IE al aR b 
(su A woah sy Os) psinnvs cos$s T(1-s) 


*In this theorem p appears as an independent parameter not connected with @ by the 
previously adopted relation p = (1 + @)/2. 


§3.2. THE RANDOM VARIABLES Y(a,0) AND Z(a, p) 193 


On the other hand, for the right-hand side, 


_ sintvps cost@s T(1+s) 
~ psinmsv cos $ (s/a) T(1 + s/a) 


Using the known equalities 
r(s+1)=sT(s) and [(s)f'(1—s)=7/sinzs 


and the fact that v = (2p)~!, we find that the ratio of the Mellin transforms 
computed is equal to 


. JE Fig . . 
sin typs/&œ cos 54s cos $s/a sintvs sins 


sintvs cos>s cos5és sinavpssinms/a’ 


which, after substitution of the value v and the obvious cancellations, takes 
the form 
sin s/acosFsasin7s _ 
cos }ssin }ssin7s/a 
Since p/a < 1 and a|6| < a(1/a) = 1, the parameters determining all the 
random variables in (3.2.21) are in the domain of admissible values. 
We mention two particular cases of this theorem. 


iL Heo 1,¢=0, and p=1, then 
IY (1/a, 0)|72/* £ Z-1/2(1/a, 1/2) 
£ 71/2(1, 1/a)|Y (a, 0)| (3.2.22) 
2 71/21, 1/a)Z(a, 1/2). 
2. If a > 1,6 =1/a, and p= 1, then 
Y (1/a, 1)7/* £ Z-"/2(1/q, 1) 
£ 71/2(1, 1/a)|¥ (a, 1/a)|. (3.2.23) 
If now (3.2.18) holds, then, transforming the left-hand side of (3.2.23), we get 


that 
Z(a,1/a)£ Z'/?(1, 1/a)|¥ (a, 1/a)). (3.2.24) 


The “equivalence-type” relations considered above are not, of course, ex- 
hausted by the list given here. It can be extended both by combining the 
relations of this and the next section, and by finding relations of a new type. 
For example, among the given relations there were none that would include 
the following easily verified equalities. 

Let a,@ be a pair of admissible values of the parameters. Then 


Z'/2(1, |8|)|Y (a, 0)| Z(a, |8|). (3.2.25) 
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If in addition a < 1, then 


IY (1, 0)|?Z (a, 101) = Z?(1, |8|)¥ (a, 1). (3.2.26) 


83.3. The random variables Y(a,#) and Z(a, p). 
Multiplication and division theorems 


In this section the terminology “multiplication and division theorems” is 
used more or less conditionally, and its main purpose is to perform a classifi- 
cation in our collection of relations between random variables. 


THEOREM 3.3.1. Let (a,0) be a pair of admissible values of the param- 
eters, and let 0 <a’ < 1. Then (p = (1 + 0)/2) 


Y (a, 6)¥'/“(a’, 1) £¥(aa’, 0), (3.3.1) 
Za, p)Z"/*(a', 1) = Z(aa, p). (3.3.2) 

If1< a < 2, then 
Z(a’,1/a’)Z"/“ (a, p) = Z(aa’, p/a’) (3.3.3) 


for any pair (a, p) such that aa’ < 2 and p> a’ —1/a. 
COROLLARY 1. 1. Ifa <1 anda’ <1, then 
Y(a,1)Y¥!/*(a’,1)£Y(aa’, 1). (3.3.4) 
2. Ifa>1,a’ >1, and ad’ < 2, then 
Z(a’,1/a’)Z1/* (a, 1/a) £ Z(aa’,1/aa’). (3.3.5) 
COROLLARY 2. Let a= 4,0 =1, anda’ =1/2* in (3.3.1). Then 
Y (1/2, 1)¥2(1/2*, 1) £Y(1/2*+!, 1). 


By (3.2.20), which implies that Y (1/2, 1) £ (2N?)—!, where N is a random 
variable with standard normal distribution, this relation gives us (3.4.13). 
If a = 1,0 = 0, and a’ = 1/2* in (3.3.1), then the relation 


N-N-1Y(1/2*,1)£Y(1/2*,0) 


follows from the fact (see (3.3.21) and (3.3.17)) that N/N£Y (1,0). It then 
follows from (3.4.13) that 


21-2" N . N-1N-2N-4...N-2 ZY (1/2*, 0). 
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The last relation, like (3.4.13), was apparently the first relation of that kind 
known in probability theory. These relations were obtained in 1946 by Brown 
and Tukey [245]. 

PROOF OF THE THEOREM. Let us consider (3.3.1). The components of 
the characteristic transform of its left-hand side have the form 


$ ; _ cos F (k — s0) (1 — s/a)P'(1 — s/aa’) 
we(s, 4,8) (Z,a, 1) = a aE 


_ cos 5(k — s0) T(1 — s/aa’) 
cos (k-s) T(1-s) 
= wk(s, aa’, 0), k=01. 
Moreover, considering that a’ < 1, we have that 
|@| < min(1, 2a — 1) < min(1,2/aa’ — 1), 


i.e., the pair (aa’,@) belongs to the domain of admissible values. The relations 
(3.3.2) and (3.3.3) are proved similarly (3-4). 


THEOREM 3.3.2. For any pairs (a,6) and (1,0') of admissible values, 
Y (a, 0)¥°(1,0’) ŻY (a, 00’). (3.3.6) 


If (a,p) and (1,p') are pairs of admissible values such that pp’ > 1-1/a, 
then 


Z(a, p)Z°(1, p’) £ Z(a, pp’). (3.3.7) 

COROLLARY. For any pair (a,0) of admissible values, 
Y(a,0) ÉY (a, 0a)¥%(1,6/0q), (3.3.8) 
Z(a,p) = Z(a, pa)Z*(1, p/pa), (3.3.9) 


where 6, = min(1,2/a—1) and pa = min(1,1/qa). 
In particular, if a < 1, then 


Y¥(a,0)=Y(a,1)¥(1,6),  Z(a,p)£Z(a,1)Z(1,p), (3.3.10) 


1.€., there ts a separation of the dependence of random variables Y(a,0) and 
Z(a,p) on the parameters determining them. 


The proofs of (3.3.6) and (3.3.7) are carried out in the same way as the 
proof of the preceding theorem. 


THEOREM 3.3.3. Consider pairs (a1,6,) and (a2,62) of admissible val- 
ues of the parameters such that a, < 2ag. 
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Let a > 1, and define 0 = 020; and &’ = 62 if |01| < 1/az and 0 = a262 
and 0 = 0; if |02| < 2/a, —1/a2. Then 


Z (1 ~) ¥(a1,61) dyra (24,6) Y(1,0')Y Q ~) (3.3.11) 
az) Y (a2, 42) a2 a2 
1/a2 
Zil2 Ç ~) Fea £ iy (24.6) IY (1, 6’)). (3.3.12) 
Q? Y (œz, 02) a2 


Let ag < 1, and define 0 = a20, and 6’ = b2 in the case 101| < 2/a,—1/a2, 
and 0 = azz and 6’ = 0; in the case |02| < 2/aı — 1/a2. Then 


Y (ai, 41) d 1 
yiajilm tS (2.0) Y (1,0) Z1 (1, a2), 3.3.13 
( a2) (a5 Ba) m (1,0) (1, a2) ( ) 
Y (aı,0 1/az ; 
Fea a y (2.0) IY (1, 0')|Z1/2% (1, a2). (3.3.14) 


PROOF. The conditions on |6,| and |02| arise as necessary conditions for 
the values of the pair (a;/a2,4) of parameters to be admissible, which is 
equivalent to the requirement that |0| < min(1, 2a2 /a, — 1). The validity 
of (3.3.11)-(3.3.14) can be verified by the method already worked out. We 
confine ourselves to checking two of them: (3.3.11) and (3.3.14). In the first 
case we compute the characteristic transform of the left-hand side, consider- 
ing that for the random variable Z(1, 1 /a2) the functions wo and w1 of the 
characteristic transform coincide with M (s, a, p) (k = 0, 1): 


M (s l, +) wWk(S, 01, 91 )WE(—8, a2, 02) 
2 


_ ag sin ms/az cos 5(k — s01) T(1 — s/a) cos $(k — s02) P(1 + s/a2) 


sin tts cos (k-s) T(1—s) cos$(k—s) T(1+s) 


The right-hand side of (3.3.11) has characteristic transform of the form 


Wk (=. a ) we(s, 1, 6") we (s l, >) 
Q2 Q2 2 
_ cos $(k — s0/a2) T(1 — s/a) cos Z (k — 0's) cos Z (k — s/a2) 
~ cos %(k—s/a2) T(1—s/az) cos Z(k—s) cos5(k—s) 
Division of the first expression by the second gives us 
sin 7s/a2 cos 3(k — s01) cos 3 (k — 809 )P'(s/a2)P(1 — s/a2) 
sin 7s cos 3(k — s0/a2) cos 5 (k — 6’s)T'(s)[(1 — s) 


after some simplifying transformations and cancellations. Since ['(s)P'(1—s) = 
T/sin rs, the choice of 0 and 6’ implies that the last expression is equal to 1. 
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In proving (3.3.14) it should be recalled that it suffices to compute and 
compare only the functions wo when dealing with positive random variables. 
For the left-hand side, 

5) ’ 6 ~ Oy ’ 6 wn tee. oY =” cme ne 
wols, a1, 61)wo(—s, a2, 62) cos3s [(1—s) cos%s I[(1+s) 
For the right-hand side, 


wo (=. a) wo(s, 1, 0’) wo (= Laz) 
a2 a2 202 


_ cos 348/a2T(1—s/a,) cos 30's sin 3s 
cos 58/a2 T(1—s/az) cos$s agsin%s/a2 
The ratio of these expressions has the form 
cos 5418 cos 528 sin ms/a2 T'(s/a2)P'(1 — s/az) 
cos 54s/a2 cos 54's sin rs r(s(1-8s) ’ 
after transformations and cancellations. By the choice of parameters, a further 
transformation leads us to the conclusion that this ratio is 1. 


THEOREM 3.3.4. Let (ai, pı) and (a2, p2) be pairs of admissible parame- 
ters such that ay < 2az and 1/a2—1/a; < pı < 1/ag, or such that ay < 2a 
and 1/aq — 1/œı < p2 < 1/a,. With p = azpı and p! = pz in the first case 
and p = a2p2 and p' = pı in the second case, it follows that 


Z(a1,p1) d z1 (2 ) 
z Eza | 2 p) Z, p). 3.3.15 
Z (a2, p2) a2 p) 21,0) l ) 
COROLLARIES. 1. For any admissible values of the parameters a and a’ 
ACCME PTN 
— ~ => ; dki 
PG Ezeo, a) (3.3.16) 
2. Ifa > 1, then for any p < 1/a 
Z(a,1/a) a 
— ~ = Z(1,p). 3.s.l7 
3. Ifa <1, then for anyp< 1 
aah £73/2(1 pa) (3.3.18) 
Z(a, p) a 
4. Suppose that 1 < 2a < 2. Then for any admissible pairs (a, p) 
Z(2a,1/2a) a 1/ 
—; n = |Y (2, 0)|*/*Z(1, p (3.3.19) 
Tay =H 0)IM2Z(1,0) 
and for any admissible pairs (2a, p) 
AGH) á IY (2,0)|!/*Z(1, p). (3.3.20) 


Z(a,1/2a) — 
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The proof reduces to a check that the Mellin transforms of the distributions 
on the left and right sides of (3.3.15) coincide. 

In many cases the relations given above between Y(a,@) and Z(a,p) can 
facilitate and simplify the computation of the distributions of statistics formed 
by random variables with stable laws. We illustrate this by two examples. 

EXAMPLE 1. Consider independent random variables Yı £ Yə é Y3 = Y4 £ 
Yp(a, ß,Ņ, à), and form from them the statistic 
viat 
- Yz-Yı 
(statistics of this kind are used, for example, in the problem of recovering 
the type of a distribution). We compute the distribution of L. According to 
(2.1.6) and (2.1.1), 


L 


Yı = HEY, = Y,2Yp(a,0,0, 2X) 
4 (2)1/2¥p(a, 0,0, 1) = (2A)'/*¥c(a, 0, 1). 
Consequently, 
a Y(a,0) a Z(a, 1/2) 
~ Y(a,0)  Z(a,1/2) ~’ 
where U is a random variable taking the values 1 and —1 with probability 5: 


Since the distribution function F,(z) of L is symmetric, it is connected with 
the distribution function F',(z) of a cutoff L by the equality 


(3.3.21) 


Fr(z)=2F,(z)-1, 2>0. (3.3.22) 
Therefore, to compute F(z) it suffices to compute F,(z). Since 
Z(a, 1/2)’ 


it follows from (3.3.15) that 
££71/2(1, a/2)Z(1, 3). 


As is clear from the comment after Theorem 2.3.1, the distribution functions 
of the random variables Z!/°(1,a/2) and Z(1, 4) are, respectively, 
T“ + cos 22) 1 


—-(1- a) (3.3.23) 


Fz) = a E arctan ( 7 


and F(z). From this we find that 


fy, (2) = j ” F,(2/y) dFa(y) 


s 
sın za 
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and the density pz (zx) of L has the form Pr) 
dy 


pel) =5 f Filer) 


2°? | f2\" = y*—? sin Za 
= =) (=) + 1 ana Gon FAET = 2 dy. 
T4 Jo y y°* + 2y* cos Fa + 1 
EXAMPLE 2. Suppose that 1 < a < 2, and let néng er ty, be 


independent random variables distributed like Y (a, —1,7,à). Denote by Z; 
a cutoff of the random variable Y; — +A, and consider the statistic 


z 1/a 
n*Zo * ) 
= | =- . 3.0.24 

K (= l ) 
According to (2.1.2), the distribution of Z; does not depend on y. On the 


other hand, by the property 1 of cutoffs and the very structure of the statistic 
Q, its distribution does not depend on À, i.e. 


oå nZ- (a, 1/a) adi 
~ \Z-4(a,1/a) +-+ Z-*(a, 1/a) 
We compute the distribution of Q. It follows from the duality relation (3.2.18) 
that Z~°(a,1/a) = Y(1/a,1). Moreover, it follows from (2.1.4) that 


n-*(Y(1/a,1)+---+Y¥(1/a,1))£Y(1/a, 1). 


which, when the rule (3.3.16) is applied to the ratio in parentheses, gives us 
that 


Thus, 


Q£Z(1,1/a). (3.3.25) 
The distribution of the right-hand side of (3.3.25) can be obtained without 
difficulty from that of the random variable Z!/“(1,a/2) (see (3.2.20) ); 


Fois) = Z arctan (Fae) +1- z (3.3.26) 


The function F,(xz) has another representation in integral form which is 
connected with a number of interesting facts and to which we devote some 
attention below. The facts presented are taken from the two recent papers 
[248] and [249]. 

The second representation of F,(z) is 


1 Lee 1+|/r+y|*\ dy 
_1 Se) ee 3.27 
Falz) 5 tia s log (EEIN (3.3.27) 
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This formula can be proved with the help of the following simple but useful 
lemma. 


LEMMA 3.3.1. Suppose that F(x,y) and f(u,v) are the distribution func- 
tion and characteristic function, respectively, of a pair (U,V) of random vari- 
ables such that 

Elog?(1 + |U|) + Elog?(1+ |V|) < œ; (3.3.28) 


[fi u,v) one (3.3.29) 


where D: = {(u,v): |u| > €, ol > E}. Then 
F(0,0) = — 4 + 5(F(0, 00) + F(oo, 0)) 


-d PP lee) = 1-10) = Hv) + n . 


for any £ > 0, and 


PROOF. It is not hard to verify that 


F(0,0) = ; - g | fsenzar(e.y 
1 1 
-1 ff snndrceu)+4 [fsenzsenyd leu) 


By the equality 
1 f" a dase OU 
sgnz = =| (e7 -e **) —, 
im Jo u 
we get (3.3.30) from the last expression for F (0,0) after changing the order 
of integration (which is allowed by the conditions (3.3.28) and (3.3.29) of the 
lemma). 
COROLLARY. Let Yı and Yz be independent random variables with the 
same distribution as Y(a,0), 0 < a < 2. Consider the variables U = Yı — tY2 
and V = Yə. According to the lemma, 


F(z) = F(0,0) 
=5+5 ian exp(—|u|*)[exp(—|ux — v|*) — exp(—lur + v|*)] 


But this integral occurs in tables (see, for example, (2.4.1.28) in [251]). It can 
be expressed by means of a single integral, and leads us to (33.a) 
In addition to being connected in a simple way with the distribution F(a) 
n (3.3.22) of a cutoff, the distribution F(z) also has a simple connection 
with the distribution Ga(x) of the random variable 


A = min(|L],|L|~!) = min(L, L~). (3.3.31) 


du dv 
uv ` 
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Namely, 
Go(x) = 4(Fa(£) — $) =2Fu(z), O<2<1. 


The distribution of A is used (as in [248]) to estimate the parameter a. 

An interesting feature of the distribution F(x) (and also of the distribu- 
tions F,(z) and G(x) connected with it) is the possibility of extending the 
domain of variation of the parameter a beyond the right-hand bound a = 2. 
It turns out that Fa(x) remains a distribution function even when a > 2. 


LEMMA 3.3.2. The function F(x) defined by (3.3.27) fora > 0 ts a 
distribution function. 


PROOF. The fact that F,(x) is nondecreasing follows from the obvious 
fact that its derivative 


Fla) = ae f (O+ lel hv) -+ fle jn + yiey-) 


is nonnegative. Replacing the variable y by yz and letting x go to ov, we get 
that 

+y dy 

L—y| y 

The integral in the second term can be found in tables (see (2.6.14.24) in 
[251]). It is equal to $ (note the factor 1/7?). 


Since 
1 HE mec 1+y] dy 
Fy. (—oo) = = — = log | ——= | = 
as T — —oo, it has thus been proved that F,(—oo) = 0 and F,(co) = 1 for 
any a > 0. 
We note here a property of Fx in the case when a = 2n is an even number. 
Denote by R, a random variable taking the n values 


1 i T i 
Faldo] = aal log 


n-1l1 n-3 n—3 n-1 


n n n n 


each with probability 1/n. Then consider a random variable Ya(1,0,0,1) 
independent of R, and form with its help the new random variable 


Y4(1,0,0, 1) cos(t Rn /2) + sin(t Rn /2) £ Yo(1, Rn). (3.3.32) 


The last relation is a consequence of (2.3.5a). We present the promised 
property of Fa(x) as the following theorem, whose proof we omit, referring 
the interested reader to the original source [249]. 
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THEOREM 3.3.5. For any integer n > 1, 
Foal) = P( Yol Ra] < 2). (3.333) 


Thus, the distribution F(z) is a mixture of Cauchy distributions with 


linearly transformed argument (cf. (3.3.32)). Since Rn 4, R as n —> o, where 
R is a random variable uniformly distributed in the interval (—1, 1), the limit 
distribution 

Falta P(Yoll, R] < 2) 


exists and has density (see (2.3.5a)) 


i f du 1 l+r 
= — A SSF a) 
Poo(z) =f ihe? m2 og |= 


geil. 
Precisely the same relation holds between the density of the distribution 
F(z), 2 < a < oo, and the symmetric trans-stable functions a(x,a,0) as 
is observed in the case when 0 < a < 2. 


§3.4. Properties of extremal strictly stable distributions 


The class W, which was the main subject of the preceding section has a 
subclass uniting the distributions with parameters (a, ĝa). As a matter of fact, 
the main role among them is played by the distributions with a < 1, because 
0 = 1 corresponds to the distribution degenerate at the point x = 1, while 
the distributions with a > 1 (more precisely, the part of them concentrated 
on the semi-axis z > 0) can, by the duality property, be expressed in terms 
of the distributions in the first group. The random variables Y(a,1), 0 < 
a < 1, whose distributions form this group (and only they) are positive with 
probability 1; hence Z(a,1) = Y(a,1). The abbreviated notation Y (a) will 
be used below for the variables Y (a, 1). 


THEOREM 3.4.1. Suppose that wi,...,wWn are numbers such that 0 < 
wj <1. Then for any n > 2 


Y (On) ZY (wi )Y V (wo) V 1/022 (wg) + Yet m-1 (wy), (3.4.1) 
where Qn = Wiw Wn. 


REMARK. By relabeling the numbers w; in inverse order it is possible to 
give (3.3.23) the form 


¥ (an) ZY? (wy )¥ #24? (wo) -YHn (wn) (3.4.2) 


with the help of a simple transformation. 
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The proof of (3.4.1) is by induction. For n = 2, (3.4.1) coincides with 
(3.3.4). If it is true for a set of n — 1 variables, then, by (3.3.4), 


Y (On) ÉY (an-1wn) Y (an-1)Y 2-1 (wp). 


If we now replace Y (œn—1) by its expression according to (3.4.1), we get (3.4.1) 
for n random variables. 

Despite the fact that (3.4.1) and (3.4.2) are simple consequences of (3.3.4), 
they serve as the source of a number of interesting connections between the 
distributions in the subclass of stable laws under consideration. Since infinite 
products of positive random variables will be encountered below, we agree to 
say that these products converge if the series of logarithms of their factors 
converge with probability 1. 

The condition that the product EX iEX5--- converge for some real value 
s # 0 is for us a convenient necessary and sufficient condition for the con- 
vergence of an infinite product X;X2--- of independent and positive (with 
probability 1) random variables. This is not hard to establish if the question 
of convergence of a product of random variables is reduced to the question of 
convergence of a sum of random variables. 

As before, E denotes a random variable with exponential distribution 


P(E > x) =exp(—z), ge > 0, 


THEOREM 3.4.2. Suppose that W1,W2,... 18 a sequence of numbers such 
that 0 < w; < 1, anda =wyw2-:: is their product. If a> 0, then 


Y (a) É Y (1) ¥ 1/8 (wy) y 3/12 (yg) (3.4.3) 
Ye (a) É Y“ (wy) V 22 (wy) -- (3.4.4) 
If a= 0, then 
Y“! (w) Y (wp) --» £1/E; (3.4.5) 
in particular, for any 0 <w < 1 
YY (w)¥" (w)Y h (w)--- 44/8. (3.4.6) 


The proof of (3.4.3) and (3.4.4) reduces to a passage to the limit as n — oo 
in (3.4.1) and (3.4.2), respectively, with use of the Mellin transforms of both 
sides of the equalities and the criterion given above for convergence of infinite 
products of independent random variables. 

For brevity let Z(a) = Z(a,1/a), 1 < a < 2. The duality relation (3.2.18), 
namely, Y(1/a) = Z~°(a) for 1 < a < 2, makes it possible to get relations 
of the type (3.4.1)-(3.4.4) and (3.4.6) for the random variables Z(a). We 
present them below. 
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If w1,w2,... is a finite or infinite sequence of numbers such that w; > 1 
and a = wjw2::: < 2, then 


Z* (cx) £ ZY (wy) Z? (we) +++, (3.4.7) 


Zz) £ Z(w)Z"/ (wy) Z 1/2? (wg) <> (3.4.8) 


Application of the duality relation to (3.4.6) leads for any a, 1 < a < 2, to 
the relation 
Z(a)Z"/*(a)Z'/@ (a) ÉE. (3.4.9) 


It is easy to see that 
Z(2)Ż|Y(2,0)| and ¥(2,0)4V2N£UZ(2), 


where N is a (0, 1)-normally distributed random variable, and U isa snc 
variable independent of Z(2) and taking values +1 and —1 with probability 4 y 
each. These considerations together with (3.4.7) and (3.4.8) make it possible 
to get the relations 


a U 
we Zol (yy, )Zv1w2/2 (un). + 
J2 ( 1) ( 2) 
U 
£ FZ (un) ZV (wg) Z 2? wg). = 3.4.10 
5 Ben) 21% (2) 2028 (9) (3.4.10) 
for any decomposition of the number 2 into a finite or infinite product 2 = 
wiwa, Wi >. 
Similarly, using the duality relation again, we can claim that 
U 
ys y-}/2 Y—1/2w1 w y —1/2wiwe as eee 
YOY w) (w) 
a U pa E 
Say fw) Y yg) 34.11 
AB (w1) (w2) ( ) 
for any decomposition 4 z = WW? ' j <1 into a finite or infinite product. 


We give three more relations in which the random aan N appear in 
an interesting way. Namely, as is clear from (3.3.19), for 5 LaL 


Z(2a) a |V2N 
a) 


1/a 


(3.4.12) 


Further, if we choose wy = -:: = Wn = 3 in (3.4.1) and use the equality 


Y (5) £N-? /2, which follows from the duality relation, then we arrive at the 


representation of Y(1/2") as a product of powers of independent normally 
distributed random variables: 


Y (1/2")22!-2"N-2N-2* ...N-2", (3.4.13) 
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Finally, we get the third relation by taking the logarithm of (3.4.9) and setting 
o = 2: that 


log ES log 2 + log|N| + 27" log |N| + 2~* log (NI >, (3.4.14) 


However, the expansion of the random variable log E in a series of indepen- 
dent random variables is possible with the help of variables other than log |N]. 
This is shown by (3.4.6) if the logarithm of both sides is taken. Namely, 


log E £ - {alog Y (a) + a? log Y (a) + a? log Y(a)+---} (3.4.15) 


for any0<a< 1. 

Both finite and infinite expansions of random variables in products of in- 
dependent random variables can very simply be adapted for computing the 
distributions of, so to speak, “products of ratios”. With this purpose we first 
compute the distribution of the ratio Y(a)/Y (a). According to (3.3.18), for 
œ< 1 

Y(a)/Y(a)£Z1/4(1, a). (3.4.16) 
The distribution function F,(xr) of the random variable Z1/ “tesla, 0 < 
a@ < 2, has already appeared in the analysis of an example given in the last 
section. And we can use its expression (3.3.23) for computing the distribution 
function of the right-hand side of (3.4.16). After simple transformations we 
get that for any 0< a< 1 


Rar = P(Y (a)/Y (a) < z) = P(Z(1,a) < 2%) 
a zara) 1 


mimes l, (3.4.17) 
Sın TQ 


1 
= — arctan ( 
TO 2a 


We know (see (2.9.2)) that Y°(a)4 E-! as a — 0. The distribution 
R,(z%) thus does not have a proper limit distribution as a > 0, but, on the 
other hand, R,(z) does. It is not hard to compute that as a > 0 


Ro(z) > Ro(z) = P(E/E < z) =2/(1+72), x>0. 


We take two each of the complex of relations (3.4.1) and (3.4.2), with 
the random variables making up the left-hand (right-hand) sides mutually 
independent, and then we divide the corresponding parts of these relations 
one by the other. As a result, 


(heed) (Heady. (Z) a Klan) d guenta, an) 
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Analogous considerations can be written for finite or infinite products of 
powers of variables Z(w;)/Z(w;), based on (3.4.7), (3.4.8), and the fact (a 
consequence of (3.3.17)) that for a > 1 


Z(a)/Z(a) = Z(1,1/a). 


It is clearly possible to take the limit as n — oo in (3.4.18) and (3.4.19) if 
Qn — a > 0. However, if an — 0, then the limit distribution in (3.4.18) does 
not exist, though it exists in (3.4.19), and we get that 


(Fy (Fay å a (3.4.20) 


An analogous assertion follows from (3.4.9) for the random variables Z(a) 
with 1 < a < 2. Namely, 


The following relation stands rather alone. Denote by TL, v > 0, a random 
variable having T-distribution with parameter v, i.e., a density of the form 


1 ai 
= —— ” = > . 
pulz) ON exp(—2), x > 0 
THEOREM 3.4.3. For any integer n > 2 
d n 
1/Y (1/n)=V a ynl ayn UT (n-1)/n” : (3.4.22) 


COROLLARY. By the relation (3.3.1) with a = 1 and a’ = 1/n, (3.4.22) 
implies that a random variable with the parameters a = 1/n and \0| < 1 has 
the representation 


on d y,— n 
Y *(1/n,0)=Y (1, OP nT ajn Tinyn 
£ FIL OL snl 2/n aii Fir- n”. 


PROOF OF THE THEOREM. The Mellin transform of the distribution of 
1/Y(1/n) has, by (3.0.2), the form 


This fraction can be expanded in a product of ratios of I’-functions by using 
the so-called multiplication theorem for the functions T(x), £ > 0: 


1 n—-1 (2r)("-V/2 
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We have 
(1+ ns) _ prs (s + 1/n) F(s + 2/n) | Te+(n—1)/n) 
T(l+s) — ['(1/n) ['(2/n) [((n—1)/n) ` een) 


Since the ratio I'(s + v)/I'(v) is the Mellin transform of the distribution of 
the variable [',, a comparison of the Mellin transforms on the left and right 
sides of (3.4.23) gives us (3.4.22) (3-6). 

This concludes our getting acquainted with the “multiplicative” properties 
of strictly stable laws. Here something should be said about where these 
numerous and diverse relations between distributions (which for conciseness 
we have clothed as relations between random variables) might turn out to be 
useful. 

First of all, such relations can be a convenient tool in computing distribu- 
tions of various kinds of functions of independent random variables Y (a, 9) 
and Z(a,p). If we supplement the relations of a multiplicative nature in this 
chapter by the additive relations of §2.1, we obtain a kind of distinctive “al- 
gebra” in the set of independent random variables of a special form. This is 
especially noticeable if we single out a set of independent random variables 
having distributions in W, with the parameter value 0 = Ja, 0<a< 2. 

To illustrate, we show how (3.4.13) and (3.4.22) help to get expressions of 
a new form for the densities g(x, +,1) and g(z, ft 1) as integrals of positive 
functions. According to the indicated relations, 


YESH), YS alan, 


and hence 
1 fei ) ( 1 ) dy 
¢.-,1})= —,~,l ,= 1) ——, 3.4.24 
a( 4 ) J (7 2 # vy 2 2y/Y ( 
2 ae | 2> T y \ dy 
2 i+ o T Y\ ay 


The form of the densities g(x, 5,1) and p, (z) is known. The first was given 
back in §2.2 (it is also easy to compute with the help of Theorem 3.4.3, since 
¥-1(1) £47 / according to (3.4.22)), and the second is well known. After 
substitution of these densities in (3.4.24) and (3.4.25) and a simplifying change 
of variable, we find that for z > 0 


1 E nae) ET TE EE 
g T, y! =f exp į -77 (yt +y“)? dy, (3.4.26) 
0 


1 1 ay 3 anf 
bA al 2 d- dy. 3.4.27 
o (2,51) ae ; exp 3 may y~”)? dy ( ) 


208 3. SPECIAL PROPERTIES OF CLASS W LAWS 


The second sphere of application of the “multiplicative” properties of stable 
laws is the construction of various kinds of useful statistics with explicitly 
computable distributions. Two examples of such statistics were analyzed at 
the end of the preceding section. 

Finally, some of the relations turn out to be useful in stochastic modeling 
problems for generating sequences of random variables with stable distribu- 
tions. For instance, given a sequence of random numbers having the standard 
normal distribution, it is very easy with the help of (3.4.13) to construct 
sequences of random variables distributed like Y (1/2”). 


§3.5. M-infinite divisibility of the distributions 
of the variables Y(a,9) and Z(a, p) 


The classical theory of limit theorems for sums of independent random 
variables has been generalized in various directions. One such direction was 
a scheme with a generalized concept of sum: a certain operation with respect 
to which the set of possible values of the random elements being “added” 
forms a locally compact abelian group. By analogy with the addition scheme 
(the A-scheme), it is possible to introduce in this generalized scheme the 
concept of infinitely divisible laws (see [101]). The scheme for multiplying 
independent random variables (the M-scheme) is a direct generalization of 
the A-scheme. If in the former we single out the case when the random 
variables being multiplied are positive, then we obtain the A-scheme to within 
an isomorphism. The analogue of infinitely divisible distributions in the M- 
scheme can be defined as follows (this definition, used in [101], differs in form 
from the definition used in [64], but it singles out the very same class of 
distributions, which we agree to denote by M). 

DEFINITION. The distribution Fx of a random variable X belongs to M 
if there exists a sequence of integers 1 < nı < nz < --- such that for each 
number n; the random variable X can be represented as a product of nj 
independent and identically distributed random variables, i.e., 


X4£X1X2---Xn,, where Xp Xi. 


The distributions Fy € M can be described in terms of the correspond- 
ing characteristic transforms Wx(t). It turns out that Fx is an M-infinitely 
divisible distribution if and only if the functions w,;(t)x have the form 


wo(t)x = aofi(t)fo(t), wı(t)x = a1fi(t)/fe(t), 
where fı and fz are the characteristic functions of A-infinitely divisible laws, 
and 


fo(t) = exp 1 fe = 1) dtta(t)} 
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where H3 is a spectral function, and the real numbers ag and a, satisfy the 
conditions 


isasi iea (-2 f atz(x)) (3.5.1) 


In the class IN we single out the special subclass I’ by the condition that 
among the sequences nj < no < --: there is a sequence consisting of even 
numbers. A distribution F € M is in the subclass M’ under the additional 
(necessary and sufficient) condition that a; > 0. A feature of the subclass M’ 
is that the random variables X with distributions in M’ can be decomposed 
into a product of n > 1 independent and identically distributed factors for any 
number n, even or odd. At the same time, an X with distribution in MADY 
can only be decomposed into a product of an odd number of independent 
factors. 

In connection with the random variables Y (a, 0) and Z(a, p) of interest to 
us, the membership of their distributions in Mt has to do with the correspond- 
ing representation of the characteristic transform W(s,a,@) and the Mellin 
transform M(s,a,p). Namely, G(z,a,0) € M if and only if 


wo(s, œ, 8) = aop1(s)p2(8), 


(3.5.2) 
wı(s,a, 0) = a191(s)/p2(s), 


where 
pı(s) = exp fas + [te — 1-—ssinz) dHh(x)} 


male ep fe- 1) dHa(a)} , 


Hı and Hg are spectral functions, and ag and a, satisfy the requirements 
indicated above. Since wo(t)x = wı(t)x for positive random variables X, the 
condition 


M(8,a,p) = 91(s) (3.5.3) 


ensures that G(z, a, p) is in M. 
It is not hard to see that if the distribution of a nonnegative random variable 
belongs to M, then it must be in Mt’. 


THEOREM 3.5.1. For any admissible values of the parameters (a, p) the 
distribution of a random variable Z(a,p) belongs to W. 


PROOF. Let us show that M (s, œ, p) is representable in the form (3.5.2). 
We have that 


sintpsT(1—s/a)  T(1-—s)f(1+s)F'(1—-s/a) 
MOCO = pnns TAs) T PO CA 
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It is not hard to derive the following equalities from the theory of the functions 
T (for this we can use, for example, the integral representation of log T (z) (see 
8.341 in [27])): 

1. In the strip |Re s| < a/b, a> 0,6 > 0, 


log (eee) = fee — 1 — su)v(u, a,b) du, (3.5.5) 
where 
v(u, a,b) = exp(—|ula/b)(|u|(1 — exp(—|ul/b)))~*. 
2. In the half-plane Res < 1/b, b> 0, 


log I'(1 — bs) = —bCs +f (e° — 1 — su)v(u, 1, b) du, (3.5.6) 
0 


where C is Euler’s constant. 
With the help of these equalities log M(s,a,p) can be written in the fol- 
lowing integral form: 


log M(s,a, p) = (—1/a + 1)Cs + fe — 1 — su)H; (u) du, 
where 
H; (u) = v(u, 1,1) — v(u, 1, p) + $(1 + sgn u)(v(u, 1,1/a) — v(u, 1, 1)). 


Since the function v(u, a,b) is nondecreasing as b increases for fixed u and a, 
it follows that on the semi-axis u < 0 


H; (u) = v(u, 1,1) — v(u, 1, p) > 0, 
just as on the semi-axis u > 0 
H; (u) = v(u, 1, 1/a) — v(u, 1, p) 2 0. 
Consequently, H; (u) is the density of some measure. If we add the fact that 
1 
J u? H! (u) du < oo, 
= 


then the representation of M (s, a, p) in the form (3.5.3) becomes obvious. 

REMARK. For symmetric random variables Z(a, p) = UZ(a,p), where U 
is a random variable independent of Z(a, p) and taking the values +1 and —1 
with probability 4, the assertion of the theorem remains in force, because the 
characteristic transforms corresponding to them have the form 


wolt) = M (rt, a, p), w(t) =0. 
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THEOREM 3.5.2. The distribution of Y (a, 6) belongs to M if and only if 
either œ <1, ora > 1 and0 =0. These distributions are in IN’ uf and only 
of, in addition, 6 > 0. 


PROOF. Let us transform the functions w,(s,a,0) to the form (3.5.2) and 
determine when H; and H3 are nondecreasing. 

We begin with the case 6 = 0. Since here w; (s,a,6) = 0, it is necessary to 
transform only one function wo(s, a, 0), for nothing prevents us from taking 
y2(s) = 1 and a; = 0 at once. We have that 


1 T(1-sa) T(5- 5s)I(5 + $8) P(1 — sa) 
= cosZs T(1—s) | r2(4) r(1- s8) ` 


According to (3.5.5) and (3.5.6), we can write log wo(s, aœ, 0) in the form 


C(1-—1/a)s + ie — 1—su)H}(u) du, 
where 
H; (u) = v(u, 5,5) + ¿(1 + sgnu){v(u, 1, 1/a) — v(u,1, 1)}. 


Obviously, H; (u) = v(u, 5,4) > 0 on the semi-axis u < 0. After aneari 
cated transformations, H; (u) takes the form H! (u) = v(u, 1, 1/a) — v(u, 1, 5) 
on the semi-axis u > 0. This difference is nonnegative, because v(u, 1 , 5) is 
nondecreasing as b increases. Therefore, G(z,a,0) € M (more precisely, it 
belongs to Mt’, since ay = 0). 

Suppose now that 640. We have that 


cos (k — 6s) T(1 — s/a) 


—— =0 E 
PTET r(1- s8)’ oe 


wk(s, Q, 0) = ai 


Hence ao = 1,aı = 0, and 


p? = wow/0, p2 = 0wo/w. 
Consequently, 

Abie r(1-— s)r(1+ s8) (1 -8/a) 
Sa dn tae [2(1 — 8) 
sae E 5s)0(1— $s)P(1+ £s) 

2 33 2 

2(s) = 

N= TG 8) 


)T 
POE + EeP Es) + 4s) 
We begin with the function p3(s). According to (3.5.5) and (3.5.6), 


logipa(s) = J (e* — 1 — su) Hi (u) du, 
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H}(u) = $(v(u, 3, 3) — vlu, 5, 101/2) + v(u, 1, 101/2) — v(u, 1, 3)) 


Moreover, as follows from 3.412 in [27], 
exp(-2 | Halu) du) = log |0|. 
Finally, since H4(u) is an even function, 


log p2(8) = pe — 1)H}(u) du, 


where H2(u) satisfies (3.5.1). 
Analogous arguments involving the function log p(s) enable us to write it 
in the form 


log pı (s) = C(1 — 1/a)s + fe — 1 — su)H; (u) du, 
where 
Hi (u) = 3[v(u, 1,1) — v(u, 1, |9|) + (1 + sgn u)(v(u, 1, 1/a) — v(u, 1, 1))). 
It is obvious that H{(u) > 0 on the semi-axis u < 0. On the semi-axis u > 0 
H; (u) = 4[2v(u, 1, 1/a) — v(u, 1,1) — v (u, 1, |0])]. 
It is completely obvious that Hi (u) > 0 if a < 1. Since 
Hi (u) ~ —}[v(u, 1,1) + v(u, 1,181) 


as u — œ in the case when a > 1, the inequality Hį (u) > 0 is false for large 
u on the semi-axis u > 0, i.e., Hı is not a measure. It was mentioned above 
that a; = 0, and thus the condition that the distribution of Y (œ, 0) belong to 
M’ coincides with the condition that 8 > 0. 

REMARK 1. We mention two particular cases corresponding to the vari- 
ables Y (2,0) (the normal distribution with parameters (0, 2)) and Y(1,0) (the 
Cauchy distribution). For them w, = 0 and 


oo 


wo(s, 2,0) = exp (-5¢: +f (e °*“—1+ su) 
0 


du 
wo(s, 1,0) = exp (fer -1+ T F 


du 
2usinh u / ’ 
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REMARK 2. The random variables |Y (a, @)| and 
Y"(a, 6) = |Y (a, 6)|" sgn Y (a, 8) 


are M-infinitely divisible together with Y (a, @). 


§3.6. The logarithmic moments of Y(a,@) and Z(a, p) 


Although the random variables |Y (a, 0)| and Z(a,p) have finite moments 
only of order less than a, their logarithms have moments of all orders, and 
this sometimes (for example, in certain statistics problems) turns out to be 
a very valuable property. It happens that moments of logarithmic type have 
a relatively simple form of expression. We occupy ourselves below with the 
computation of the following two quantities of similar type (k = 0,1, and 
m= 1,2...) 

Ykn (a, 0) = E(log |Y (a, 0)|)” (sgn Y (a, 6))*, 
Zn(a, p) = E(log Z (a, p))”. 

It turns out that the logarithmic moments yn and z, are polynomials of 
degree n in the variable 1/a and polynomials of degree at most n + 1 in the 
variable @ (or p). Explicit expressions for them can be written with the help 
of the Bell polynomials Cn (u1,..., un) which were encountered in VAA 


We introduce the quantities q1,q2,... by setting qı = (1/a—1)C, where C 
is Euler’s constant, and 


qk = AkTĂ"|Bp|/k + (1/o* — 1)T (k)e(k) for k > 2, 


where the B, are the Bernoulli numbers, ¢(k) is the value at k of the Rie- 
mann ¢-function, and the numbers Ax, are defined for the different logarithmic 
moments as follows: 

Ak = (2* —1)(1—6*) for the moments yon, 

A, = 1-—06* for the moments yin, 

Ay = 2"(1 —p*) for the moments Zp. 


THEOREM 3.6.1. For any admissible values of the parameters (a,6) or 
(a, p) the following equalities are valid (n = 1,2,...) : 
Yon = Cr(q1, 92; tee + Sra} 
Yin = 9Cn(q1,92,---5n) (3.6.1) 
Zn oa Cn(q1,Q2, 2.4 Ün: 


PROOF. Since 
M(s, a, p) = EZ*(a, p) 
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and 


wr(s, a, 0) = ElY (a, 0)|° (sgn Y (a, 8))*, 

it follows that 

= (d/ds)"w,(s, a, 8)|s=0, 

oie (d/ds)"M(s, Q, p)\s=0 
are the coefficients in the series expansion of the functions w and M in powers 
of s. However, it is more convenient to expand these functions not directly 
but after first expanding their logarithms in series. We remark that in the 
case 0 = 0 we have that w1(s, a, 6) = 0, so that the computation of yin should 
be carried out under the additional condition that 6 # 0. We have that 

log wo(s, a,@) = log cos 56s — log cos Fs 
+ log I'(1 — s/a) — logI(1 — s). 

The function log cos z can be expanded in a power series just like log r(1-— rz) 
(about this see 1.518 and 8.342 in [27]): 


] = — ee: en 2 a 
og COS T 2 K(k + 1) r 


log (1 — z) = Cz + >. st) 


With the help of these formulas we get as a result that 
log wo(s, a, 0) = Da 


where the numbers g, were defined above. If we now use Faa di Bruno’s 
formula, we find that 


n 


o0 k OO 
S 8 
mlao) =e [> ai) =1+ ; Cr (41, 925+++5 In) 7T: 
ket ‘ n=1 ` 


which implies the first equality in (3.6.1). 

Considering wı(s, œ, 0) with 6 Æ 0, it is convenient to write it in the form 
sin 58, [(1 — s/a) 
ĝsin z9 T(1—s) 
and expand the function h(s) in a series of powers of s. Of course, all the 
series expansion coefficients for w; are proportional to 0. The expansion of 
h(s) is carried out just like that of wo, with, however, the difference that the 
formula (1.518 in [27]) 


w1(s,a,0) = = h(a) 


œ ok 
log (=) = 2 ak + 1) ey x 
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is used here. The same device can be used to expand M(s, a, p) in a power 
series in s83. | 

Since the quantities qn, n > 2, are polynomials of degree n with respect to 
the variables 1/a and 6 (or 1 /a and p), by returning to the explicit expression 
for the Bell polynomials (it was given in §2.5) it is not hard to establish that 
Cn(q1,---,Qn) has the same property. In the case of the logarithmic moment 
Yin the maximal power of 0 is equal to n + 1, because yin = UCs 

We mention a computational detail which can simplify the expression for 
the quantities qą in a number of cases. The values ¢(k) do not have explicit 
expressions for odd k, though they rapidly approach 1 as k increases. For 
even k the values ¢(k) can be expressed in terms of the Bernoulli numbers, 
which are rational fractions: 


(k) = 25-1 *|By|/P(k + 1). 


In conclusion we give expressions for the logarithmic moments when n = 1 
and n = 2, considering that C1(qi) = qı and C2(q1, 92) = q? + q2: 


Yo. = 21 = (1/a — 1)C, y11 = 9yo1, (3.6.2) 
yor = (1/a — 1)?C? + 1?(2/a? — 30? + 1)/12, (3.6.3) 
yi2 = O[(1/a — 1)?C? + 2? (2/a? + 6? — 1)/12], (3.6.4) 

22 = (1/a —1)?C? + r? (1/a? — 29? + 1)/6. (3.6.5) 


CHAPTER 4 


Estimators of the Parameters of Stable Distributions 


84.0. Introduction 


It can be said without exaggeration that the problem of constructing statis- 
tical estimators of stable laws entered into mathematical statistics due to the 
work of Mandelbrot [118]-[128]. The economics models considered in those 
papers contained stable distributions whose parameters had to be determined 
empirically. Furthermore, it was discovered at once that mathematical statis- 
tics, while having at its disposal a large arsenal of methods, can be of little 
help in this case, since these methods are based mainly on such assumptions 
as the availability of an explicit form for the density, the existence of some 
particular number of moments, and so on, assumptions which are automati- 
cally not satisfied for distributions in G. In the best case they have only one 
moment of integral order (if a # 2), and only in a few cases are there explicit 
expressions for the density that would enable us to concretize the algorithms 
for estimating the parameters (say, by the maximum likelihood method). 


However, the problem had emerged, and the search began for solutions of it. 
This search was conducted in various directions and led to recommendations 
that satisfied the practical workers to various degrees. The investigations are 
still continuing, and thus far it is too early to speak of a complete solution of 
the problem if one has in mind the creation of algorithms meeting the require- 
ments of the contemporary theory, i.e., estimators that are unbiased, efficient, 
etc., at least in the asymptotic sense. In this limited space we shall not even 
acquaint the reader with the achievements presently at hand, although more 
than a few of them are very interesting. The purpose of this chapter is not 
to throw light on the state of the problem as a whole. Even a brief survey 
would require space that could simply not be justified by the direction of the 
present monograph. 


Below we propose a new approach to the problem of estimating the 
parameters of stable laws, based on the use of explicit expressions for the 
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corresponding characteristic transforms and the method of sample moments 
well known in statistics. The latter, as we shall see, gives rise to a certain 
closeness between the proposed method and the classical method of estimating 
the parameters of a normal law. (4+) 

To the reader interested in the contemporary state of the general problem 
we can recommend the fairly comprehensive survey [31], which encompasses 
the period up to 1973. A list of later investigations is in part III of the 
Bibliography. 

A feature of the proposed approach is that the problem of estimating 
parameters is solved for distributions in the class W, and then the general 
problem is reduced to the problem already solved by some special devices. 
The necessity of this division of the general problem is explained by the fact 
that explicit formulas for the characteristic transforms of stable laws exist 
only for distributions in the class W, and such formulas form the analytic 
basis of the method. 

In working with the stable laws of this class the most convenient form of 
expression for their characteristic functions turns out to be the form (E) in 
the Introduction. 

For convenience we reproduce here the expression for the logarithm of the 
characteristic functions g(t, v,9,7) in this form: 


log g(t, V, 0, T) 
= —exp{v7 !/? (log |t| +7 — i39 sgnt) + C(v7!? —1)}, (4.0.1 
2 


where C is the Euler constant, and the parameters vy, 6, and 7 vary within the 
bounds 


v> 1, |6\<min(1,2/v—1), |t| <0 


4? 
and are connected with the parameters a, 6, and À (of the form (C)) by the 
relations 


v=a*, 0e=9c, T= 1 log + C(4 — 1). 


Within the class 29 the parameters a,f,7, and A of the form (B) (we 
shall also have to deal with this form in what follows) are connected with the 
parameters v,@, and 7 by the relations 
r a= S a aeni fai, 


=e, ‘ 
2 arctan(27), tos i, 


— f Łlogà+C(4-1) fati, 
7 log À + 4 log(y? + 17/4) fo =I, 
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with inverse relations 


a= i/vyv, 
p = @max(1,1/(2Vv — 1)) sgn(1 — 1/v), 
_ {0 ify £1, 
a { z tan(70/2) ify =1, 
a | exp(r/ VT — C(1 — 1/VD) if v #1, 
exp(7 + (log cos(76/2) — log(7/2)) ify =1. 


A number of auxiliary facts will be needed in constructing estimators for 
the parameters of stable laws. These facts are in the next section, where a 
somewhat greater amount of material is collected than is used later. 


§4.1. Auxiliary facts 


To the form (£) there corresponds a distinctive form of expression for the 
characteristic transform W (s, v, 6, T), which is not hard to get from (2.6.26) 
by using the formulas for passing from the form (B) to (E). It is 

wk(s, V, 0,7) = exp{rs + Cs(1 — V/v)} 
£98 5(k — 80) (1 — sv) 
cos (k-s) I(1—s) ’ 
where k = 0 or 1 and s varies in the strip —1 < Res < 1/7. 
LEMMA 4.1.1. In the disk defined by |s| < min(1,1//v) the functions 


log wg (8, v,0,7) are analytic and are represented there by the following power 
series (k = 0,1): 


log wą (s,v,0,7) = klogð + Ts 


3 n n/2 8” 
+ Di laen(1 — 6 ) + ben(v"/? 1), (4.1.2) 


(4.1.1) 


where it is assumed that 0-log 0 = 0 for any values of 8 and that w1(s,v,0,r) = 
0 for any s,v, andr. The coefficients akn and bkn have fork =0 or 1 and 
n > 2 the form 


akn = (x? /u)(2r-*) a (1 T k))|Bnl, bin = r(n)e(n). 
Here ¢(n) is the value of the Riemann ¢-function at the point n. 
REMARK. For odd n > 3 we have that Akn = 0, since the numbers B,, are 
0 for such n. If n is even, then bpn = (27)"|B,|/(2n). 
The lemma is proved on the basis of the power series expansions of log cos z, 


log(x/sin z), and logI'(1 — x) which we have already encountered in proving 
Theorem 3.6.1. 
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For simplicity we agree to omit the subscript Æ on the random variable 
Yg(v,0,T) and the characteristics associated with it for the duration of this 
chapter. There is no danger of confusing this notation with that used earlier 
for the same random variable Y (a, 0, À) since the letters v and 7 are reserved 
for the form (E). 

We introduce the abbreviation notation 


U =sgnY(v,9,7), V = log |Y(v,0,7)|, 
U =U - EU, V =V-FEV. 
With this notation the functions w,(s,v,,7) are written as 
wz(s,V,9,7) = EU* exp(sV), kæmi. (4.1.3) 


By following the arguments in Theorem 3.6.1 it is not hard to get from 
(4.1.2) explicit expressions for the logarithmic moments of the random varj- 
ables Y (v,0,T), i.e., the mixed moments of the variables U and V. Indeed, 
since (as is clear from (4.0.1)) 

Yint r) =E Y (r00) 
= exp(r + C(1 — vv))Yg(a, 0) (4.1.4) 
for any admissible parameter values, the moments wkn = EU kV” turn out to 


be connected with the logarithmic moments (in which 1/a should be replaced 
by D) by the following equalities (k =O or 1 andn=0,1,...): 


wen = E(t + C(1 — Vv) + log [¥a(@, 6)|)” (sen Yp (a, 8))" 
n/n prei 
= 3 (rst Eli- ye *. (4.1.5) 


In particular, the moments wkn Up through order n = 2 have the following 
form: 


EU? = EV? = EU? =1, EU =90, EV =r, (4.1.6) 
EUV = 0r = EUEV, (4.1.7) 

EV? = 7? + r? (2v — 36? + 1)/12, (4.1.8) 
EUV? = 0(7? + T? (2v — 6? — 1)/12). (4.1.9) 


REMARK. We point out an interesting fact. It was mentioned in §3.2 
that the modulus |Yg(a,0)| and the sign sgn Yg(a, 0) of the random variable 
Yp(a,0) are independent if and only if either 9 = 0, or 0 < a< 1 and 
\0| = 1. Because of (4.1.4), this implies that IY (v,0,T)| and sgn Y(v,0,7) are 
independent random variables if (and only if) either 6 = 0, or v > 1 and 
ð| = 1. It is clear that the same conclusion is preserved also in regard to the 
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random variables V and U. As a supplement to this fact the equality (4.1.7) 
allows us to assert that U and V are always uncorrelated random variables. 

The mixed central moments pz, = EU*V” (k,n = 0,1,...) can be ex- 
pressed in terms of noncentral moments, and can thus also be written explic- 
itly. For any integer m > 0 


EU2"7" = Ey”, EU MAT" — EUV”, 


and so, since EU = 9, 


k 
lkn = E(U — 0)*V" = > (4) oey" 


= (-1)*EV" 2 Ee — (-1)*EUV" 3 (‘) ges 


where `o and >, denote summation over the respective even and odd values 
of 7 not exceeding k. Let Lon = Hon and Li, = Hın/0, with the latter 
understood as the limit of Li» as 0 — 0 in the case when 0 = 0. It is not hard 
to compute Lkn with the help of (4.1.2) and (4.1.3). According to (4.1.6), 
EV = 7; hence 


D Lin = exp(—sr — k log 0)EU* exp(sV) 
n=0 ` 


= exp(—sT = k log 6)wz(s, V, 0, T) 
= exp (Š Pkns” /n!), 
n=2 


where Pen = Akn(1 = 0") + bin (U"/? — 1). From this we get 
Lkn = Cr (0, Pk2, Pk3, +++ , Pkn), (4.1.10) 


where the C, are the Bell polynomials which appeared in §§2.5 and 3.6. Thus, 
we have arrived at the following assertion. 


LEMMA 4.1.2. For any admissible values of the parameters v,0, and T 
and any r,n=0,1,... 
Tarn r i r z 
=V EU V = Lin (= — Lin e 
(-1) aD (je tn 32 ( 


Let us see what some of the first moments Hkn look like. Explicit expres- 
sions for them will be needed in what follows. In (4.1.10) we have pz; = 0, so 
the first five Bell polynomials (see [70]) have the form 


Co=1, Ci=0, C2=pk2, C3=pk3, C4 = peat 3pk2- 
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This implies that 


Loo = Lio = 1, Loi = Lii = 9, 
2 2 
ae ee ee eee S 


Loz = zl 6 
n? 
— _ p2 aan fre 
be = (1 )+ = 1), 
Los = Lio = 2¢(3)(v3/? — 1), 
Tr’ 4 
Los = 799 (1 - 8") + 15 
a4 P at 4*4 fi R 2 
= = os eth = = 
m-a- Ee- +5 (Fa-#) 4-0) | 


mv? —1) + 3x4 (Fu _ 9?) + y-1)) 


which, in turn, leads to the following expressions for the mixed moments: 


0 1 


EU" =(-1) (= ("or ‘> (") gr-j+1 


m a 4 a 2 n4 
= = J _ 94 — 
EU V (1 — 6*) X C) + 100 8” 6* —7 
LEMMA 4.1.3. For any positive integers r and n, 


cov(0", V”) = (CD cov.) (Sor 


1 


= (-1)"(Lo, — L1,) > eee 


1 


(4.1.11) 


(4.1.12) 


(4.1.13) 


(4.1.14) 


(4.1.15) 


(4.1.16) 


PROOF. Using the fact that EV” = Lon and the equality (4.1.11), we have 


that ee ee ee 
coU V j= EU V -EU V 


= (—1)” (Lon — Lin) E (Ser j+, 
1 


In this equality we set r = 1 and obtain 


cov(U, V”) = —O(Lon i Lae); 


which implies the second part of (4.1.16). 
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COROLLARY. For any positive integers r and n the random variables U" 
and V” are uncorrelated uf and only if one of the following two conditions 
holds: 

1) @=0- 
2) Lon = Lin. 


The case 6 = 0 is of little interest, since, as we have seen, the stronger 
assertion that U” and V” are independent is true for it. 

The second condition involves nontrivial situations. The Bernoulli numbers 
Bn are 0 for all odd n > 3; consequently, pon = Pin = T(n)¢(n)(v"/? — 1). 
From this it follows that Lon = Lin, i.e., the random variables U’ and V” 
are uncorrelated for such n. In the case n = 1 this conclusion follows from 
the fact that Lo; = Li = 0. 

We consider a sample of size n > 2, i.e., a collection X karse Aa OF mde- 
pendent and identically distributed random variables. Denote by 


n 


= DX and S=- 
j=1 


the sample mean and sample variance. 


LEMMA 4.1.4. Assume that the sample random variables er eae 
Xn have finite fourth moment, and let a = = EXy, b2 = VarX, and c4 = 
E(X, —a)*. Then 

EAx =a, VarAx =b?/n, ES? = b, (4.1.17) 
Var S3 = (ct — bt) /n + 264/n(n — 1). (4.1.18) 

We do not give proofs of these equalities, which are known in mathematical 
statistics. Concerning this, see [43]. 

Consider a pair (L, M) of uncorrelated random variables having zero means 
and finite fourth moments. Let (Lı, Mı), ..., (Ln, Mn) be a collection of 
mutually independent pairs of random variables, each distributed like the 


pair (L, M). From the n-tuples (L;,..., Ln) and (M,,..., Mn) we construct 
the sample variances S? and S$?,. 


LEMMA 4.1.5. For anyn> 2 
cov(S?, S2,) = 1 cov(L?, M?). (4.1.19) 
PROOF. Using the last equality in (4.1.17), we have that 


cov( S2, SR) = E8282, — ESFES?, 
= ES?S?, — Var L Var M. (4.1.20) 
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It is easy to see that the sample variance can be written in the form 


8% = == (Axa - (Ax)’), 


where Ax2 is the sample mean of the squares of the random variables X;. 
Therefore, 


n 


ES si. = (.. — 


2 
:) E[AL: Am? — Ap2(Am)* — (AL) Ame + (Az)?(Am)’] 


2 
= ( 2 z) E(Iia — Ine — Iar + J22). (4.1.21) 


n — 


Letting the indices 7,7,r, and t vary independently from 1 to n, we can write 


Eli, = n? X E(L?M;M,) = n~’ $ E(L?M?) = n™'Eln, 

Elo, = n? Y E(L;iL;M?) = n~? X E(L} M?) = n`'Eln, 

Elon = n4 X E(L;L;M,M:) = n~* $_E(L?} M?) =n 7 EM. 
Consequently, 


ESS, = Eln =n? Y E(L?M?) 
= n~*[n(n — 1)JEL?EM? + nEL? M?] 
= Var L Var M + n` !(EL? M? — EL’EM?). 


Substitution of this expression into (4.1.21) gives us (4.1.19). 


$4.2. Estimators of parameters of distributions in the class W 


Let V1, ... , Vn be independent random variables distributed like the random 
variable Y = Y(v,6,7), i.e., with the distribution G(z,v,0,7), about which 
we know only that it belongs to the class W. Our problem is the statistical 
estimation of the parameters of G. 

From this sample we construct two collections of independent (within each 
collection) random variables U; = sgn Y1, ...,Un = sgn Yn and V; = log |Y1|, 

.,Vn = log|Yn|, which are distributed like U = sgnY(v,6,7) and V = 
log |Y (v,0,7)|, respectively. 

The idea for constructing estimators for the parameters v, 0, and 7 is based 
on the following three equalities: 


= Ê VarV - $ VarU +1, U= EU, 7 = EF. (4.2.1) 
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The last two were noted in (4.1.6), and the validity of the first is not hard to 
verify by computing the variances of U and V. By (4.1.6)-(4.1.8), 


Var U = EU? — (EUV)? = 1 — 6?, 
2_™ 
12 
The needed relation is obtained by eliminating 6? from these equalities. 
The idea itself is simple and not new in mathematical statistics. As a 


clarifying example we recall the classical problem of estimating the parameters 
of the normal distribution with density 


2 
ps) = = exp (-; (==) ) . (4.2.2) 


Let X1,..., Xn be independent random variables with the distribution of 
(4.2.2). Since a = EX, and o? = Var Xj, the fact that 


Var V = EV? — (EV) (2v — 36 + 1). 


2 


Ags i, we Sig (convergence in probability) 


as the sample size goes to infinity permits us to choose @ = Ax and 6? = 8} 
as estimators @ and ĝ? of the parameters a and o”. 

The method of sample moments is not very favored in mathematical statis- 
tics. It is regarded, not without reason, as an estimation method that is far 
from economical. However, in a number of cases when the distribution has 
sufficiently nice analytic properties (for example, the existence of moments 
of any order, etc.) the method of moments is capable of giving parameter 
estimators meeting contemporary requirements. And this category includes 
the case under consideration of the distributions of the random variables U 
and V, which have finite moments of any order. 

On the basis of the collections U;,...,U, and Vi,...,V, generated by the 


independent sample Yj,...,Y, we form the sample means Ay and Ay and 
take them as estimators of the parameters 0 and 7, i.e. 
6=Ay, ř= Áy. (4.2.3) 


LEMMA 4.2.1. The statistics 6 and 7 are unbiased consistent estimators 
of the parameters 0 and T with variances 


o? = Var 6 = (1 — 6?)n—}, (4.2.4) 
2 
o? = Var7 = z2 — 30? + 1)n7}, (4.2.5) 


PROOF. The fact that the estimators (4.2.3) are unbiased is a consequence 
of (4.2.1). The form of the variances is obtained from (4.1.17) and (4.1.6), 
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(4.1.8). The consistency of the estimators follows from the fact that the 
variances (4.2.4) and (4.2.5) of the estimators go to zero as n — Oo. 

Since the random variable U takes only the two values +1 and —1 with 
respective probabilities equal to p = (1+ 0)/2 and 1—p = (1- 9) /2, esti- 
mation of @ is equivalent to estimation of p, which is a well-known problem 
in statistics. It is known that p = (1 + 6)/2 is an efficient estimator for the 
parameter p (see, for example, [84]). 

Contrary to the case of the estimators (4.2.3), construction of an estimator 
for the parameter v has its complexities. It might seem that the statistic 


: 6 3 


could serve as such an estimator (consistent and unbiased), because, on the 
one hand, 
n 


S§ = An - (Ay)?) 5 EU? — (EU)? = Var U 


as n — oo, and, similarly, Sa £, Var V, which implies that pv. On the 
other hand, Ef = v by (4.1.18). 

However, taking Ô as an estimator of v is hindered by the fact that the 
domain Q = {(v,0): |@| < min(1,2/v — 1)} of variation of the parameters 
v and 0 does not coincide with the domain Q,, of variation of the values of 
the pair 0, 6, which has the form (see Figure 5) On = {(0,6): ð| < 1, D> 
—(1+1n)/2}, where nn = 3/(n — 1) for even n and nn = 3/n for odd n. But 
this means that pairs of values of Ô, ĝ can appear that do not correspond to 
any stable distributions. Consequently, we must alter the estimators v and 
§ in such a way that their new domain of variation coincides with Q. This 
can be done in different ways, for example, by drawing the normal from the 
point (0,6) to the boundary of Q (if, of course, the point is outside Q) and 
taking the coordinates of the points of the normal on the boundary of Q as 
new estimators. 

But we choose a simpler method when 6 does not vary in general, but only 
Ò varies. Namely, let 

> = max(0, (1 + |6|)?/4). (4.2.7) 


With this definition of Č the domain of variation of the values of the pair 
(Č,0) coincides with Q. 


LEMMA 4.2.2. For any n> 2 
o? = Vari = (22(v — 1)? + (9 — 56?)(v — 1) + 3(1 — 67) (3 + 0°) 5 
+ [2(v — 1)? + 6(1 — 6?)(v — 1) + 9(1 — 6?)?]/n(n — 1). (4.2.8) 


§4.2. CLASS W DISTRIBUTIONS 227 


FIGURE 5 


PROOF. According to (4.2.6), 


o? = Var (Ss? _ 5s) 


= : Var S% + = Var 2 = a cov(S¢, Si). 


Let us transform the right-hand side of this equality, using a property of 
the sample variance (4.1.18) and Lemma 4.1.5: 


| —_— 18 oe | 

2_ } v0 B Ha? O 2} _ 40 + 

f= [$e (Var V)*] + gEV (Var U)*] -z cov(U , V } = 
72 » 9 2 1 


We have available explicit expressions for the mixed central moments (Lemma 
4.1.2). With the help of (4.1.11), (4.1.13), and (4.1.15), along with the ex- 
pression for Var U and Var V we get after simple transformations that 


EU* = 1 + 20? — 394 = 4 Var U(1 — Var U) + (Var U}?, 
2 
EU’V? = Var U Var V + 7 Var DU(1 — Var DU), 


= 2 2 n 2, , 12 r? 5 
EV =(VarV)* + 2[(Var V) - 7g (Var) | + = (VarV — = Var U) 


2 4 
+ =?(Var V = “ VarU) + > Var U, 
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This tells us that the first three terms in the representation of ø% can be given 
the forms 


= 4 
S ey“ — (Var V)?] = Ww? +6 (vau = =) W +9Var U, 


= (EU — (Var U)?] = 9 Var U(1 — Var U), 
= cov(U’,V’) = 6 Var U(1 — Var U), 
where W = 6 Var V /r? — 3 Var U/2. Consequently, 


b 
n 


i = Pw? +6 (varu + =) W +3 VarU(4 — Var U) 


1 
2 2 
+ [2W* + 6W Var U + 9(Var U) ln -T 
To obtain (4.2.8) it remains for us to substitute the values of the variables 
W =v- 1 and VarU = 1 — 8°. 
LEMMA 4.2.3. 


Ep —v)? < E(v —v)? < 02 +04. (4.2.9) 
0 


PROOF. Since 
0| < min(1,2Vv — 1), 


it follows that v = max(v, (1 + |6|)?/4). Further, since 
max(a, b) — max(a’, b’) < max(|a — a’|, |b — b'|) 
for any real a,a’,b, and b’ and since \6| < 1 and |6| < 1, we have 
Ò —v = max(d, (1 + |6|)?/4) — max(v, (1 + |6))?/4) 
< max(|? — v|, |ð — 01). 
The inequality v — ù < max(|v — D|, |0 — 6|), is obtained quite similar, i.e. 
|ù — v| < max(| — v|, |ð — 4)). 


Hence, 


Consequently, E(i — v)? < E(f — v)? + E(6 — 0)?. 
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We can now formulate the following assertion, which is a consequence of 
Lemmas 4.2.2 and 4.2.3: 


THEOREM 4.2.1. The statistic Ù is an asymptotically unbiased and con- 
sistent estimator of the parameter v, and the square of the bias and the square 
deviation of Ù from the true value of v does not exceed the sum o2 +02, with 
order O(1/n) as n — œ. The exact values of the terms in this sum are given 
by (4.2.4) and (4.2.8). 


84.3. Estimators of parameters of distributions in the family G: 
the parameters a, 3, and À 


Consider a distribution G(z, æ, 8, y, à) € G and assume that we have an 
independent sample ¥j,..., Yen with this distribution. The idea for construct- 
ing estimators for the parameters a,3, and À amounts to transforming the 
sample in such a way that the result is another collection Yine Yo OF ran- 
dom variables with distribution in the class 20 and with defining parameters 
related in a one-to-one manner to the parameter a, 3, and à. The size m of 
the new collection will be essentially smaller than the original size; however, 
this must be regarded as a necessary price for the distributions of the random 
variables Y; to have the properties we need (membership in the class W). As 
a transformation of the original sample we choose 


Y? = Y3;-2 — €¥3;-1 — (1 — €)Y3;, j =1,2,...,2n, 


where 0 < € < 3 is a fixed number. 
The limit case € = 0 of this transformation is singled out in the transfor- 
mation 


Y,? = Y2;-1 — Yy; j= j 2g 15 Oth, 


that stands by itself (the size 6n of the original sample was taken so that 
for partition into thirds and halves we do not need to concern ourselves with 
remainders). 


LEMMA 4.3.1. The random variables Y? are distributed like Y(a,0,0, 2A), 
and the distribution of Y* coincides with the distribution of Y (a, 3*,*,A*), 
where the parameters 3*,*, and A* are connected with a, B,y, and À (af all 
these correspond to the form (A)) by the equalities 


B™=T)(€)B, A*=T2(E)A, y* = T3(£)2, (4.3.1) 
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where 


Ta(€) = 14 E +L- E)n, 
T,(€) = (1— €* — (1 — €)*)/Tal€) 
and A PT 
uf a : 
T3(6) = *(€log€+(1— €)logl-€)) fæ= 1. 


Both the random variables Y? and the random variables Y; have distribu- 
tions in W. 


PROOF. The statement concerning the form of the parameters 8*, A*, and 
+* reduces the use of Corollary 2.3 in this case. The fact that the distributions 
of yY? and Y* are in W is obvious, because for Y” we have that y* = 0 if 
œ #1, and 6* =Oifa=1. 

Although the distributions of Y? and Y* belong to W, construction of 
estimators for the parameters of these distributions by the method of §4.2 
is possible only after we pass from the form (A), to which the parameter 
expressions (4.3.1) correspond, to the form (E). 

Since the transition to the random variables Y* leads us to elimination of 
the parameter 7, the scheme of transformations of the original collection of 
parameters a, 3, and À to the corresponding collections of parameters of the 
other forms appears as follows: 


(a, B,-,A) a © (a, 8", 9", A*)a © (a, 8", 9", A") B 
+ (a,6*,A*)o > (v*,0*,7*)E. (4.3.2) 


All the transitions are invertible; therefore, after constructing estimators V, 6, 
and 7* of the parameters v*,6*, and 7*, we can return by the same path, 
bringing with us estimators a, B, and À of the parameters a, 3, and À. 

An analogous situation arises in the case of transition from the original 
sample to the collection {YP}. In this case the parameter ĝ is also elimi- 
nated in addition to y. Thus, only the pair of parameters a and À are to be 
estimated. The corresponding transition scheme has the form 


(a, Ce A)A E7 (a°, 0, 0, ATi T? (a, 0, \°) g 
+ (a°,0,r\°)o © (v®,0, 7°) p. (4.3.3) 


There is no trouble involved in finding the connection between the original 
parameters a, 3, and À in the form (A) and their equivalents v*,@*, and 7*, 
or between a, AÀ (also in the form (A)) and their equivalents v°, 7° if we use 
the transition formulas from one form to the other according to the indicated 
schemes. 
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We shall not give details of these simple but cumbersome computations 
here, but simply write out-the final result: 


2 
0” = 0" (à, 8) = = arctan (BT: (£) tan 5a) iai, 


9* — 6° (1, 8) = lim 6"(a, 8) = Ž arctan(8T3(€)), (4.3.4) 
T = T*(a, B, A) 
= {log + c4 — a) 


+ 5 log [T3(¢) + 5°72(¢)t3(€) tan? (Fa)|} fazı, 
=t LAA = lim 7" (a, 8, À) = log(2X) + 5 log[1 + B°T3(€)]. 


Let us note at once two features of these formulas. 

The first is that the sign of the asymmetry parameter changes in passing 
from @ to 6*. It is not hard to avoid this effect if desired by considering the 
collection of random variables —Y,* instead of the collection of Ye. 

The second consists in a narrowing of the domain of variation of the 
parameters v,6*, and r* in comparison with the domain of variation of the 
parameters in the form (£). This happens because in the case a #1 


max |Ti(¢)| = Ta) = [1 — 23-4 |/(1 4 21-9) < $, (4.3.5) 
while in the case a = 1 
log 2 
max |Ts(€)| = IBG) = == = 0.22... 


As a result, 6* varies not from — min(1,2\/v — 1) to min(1,2,/v — 1), but 
within narrower limits. We shall see that the change in the domain of variation 
of 6* introduces additional complications in the construction of estimators for 
the parameters v,@*, and r* and for the parameters &, 8, and À connected 
with them. It is in our interest to reduce the narrowing of the domain of 
variation of 6* to a minimum, which is equivalent to choosing a value of 
€ such that the functions |T(€)| and |T3()| attain a maximal value, i.e., 
f= 5: According to these considerations, a transformation of the first type 
(transition to the collection of random variables Y;*) with the value € = 5 is 
used in what follows. An expression for the corresponding parameters v, 6*, 
and 7* is obtained from (4.3.4) after substitution of the indicated value of É. 
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The inverse relations, i.e., those expressing the parameters a, 3, and A in 
terms of v,@*, and r*, are 


= 
= 
b = (v, 0*) 
_1+ exp[(1 — 1/,/v) log 2] tan ($0*/ Vv) fvl 
T—exp[(1—1/Vv)log2|tan (3/ Vv) | 
=I +) 7 tan (7 5" 
6 = B(1,6") = lim Blv, 0") = -pz tan (30°), 
E "e = cos(30*/vv) 
A= ME e e a 
r” 1 . 
xap -c(i-<)| if v= I, 
A = A(1,8*,7*) = lim X(v,0°,7*) = 5e cos (26°) (4.3.6) 
ga ’ ’ ~~ e 5) 5) — 9 9 . v0. 
For the second type of transformation (i.e., transition to the YP) 
P =v=a?, 6° =0, 7° = [log(2\) + C(1-— a)]/a (4.3.7) 


The relations inverse to them have the form 
a=1//v, à= } exp[|(7° + Cie? — Cy, (4.3.8) 
As is easy to see from the connection formulas (4.3.4), the domain Q* 


of variation of the parameters (v, 0*) is essentially smaller than the domain 
Q of variation of the parameters (v,#) which together with 7 determine the 
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distributions in 20 (the domain of variation of 7* remains the same as the 
domain of variation of 7: the whole real axis). Namely (Figure 6), 


Q" = {(v,6"): v > 5, |0*| < A(v)}, 


where 


-= 2 Aarctan | SPIC = 1/vt) log 2] - 1 m 
PO = Vime A ATI" (z) 


The value of H(t) at the point t = 1 is determined by continuity: 
H(1) = lim H(t) = z arctan (=?) . 
t—1 T 1 


We construct estimators 6* and 7* of the parameters ĝ* and r* (by (4.2.3)) 
and an estimator of the parameter v (according to (4.2.6) and (4.2.7)) 
from the transformed sample Y,",.. -, Yo, by the methods of the preceding 
section. The parameter 7* has the mele real axis as domain of values, and 
the estimator 7* of it has the same domain of values. But the estimators > 
and 6*, which vary in Q, have a larger domain of values than the domain Q" 
of elia of the parameters v and 6* being estimated by them. This means 
that the estimators č and 6* should be changed in such a way that their 
domain of values coincides with Q*. One possible variant of such a change is 
the following: 


D=v, Ö* =min(H(v),|6*|)sgn6*, 7 = 7*. 
Let us investigate the properties of the estimators D,0*, and 7*. First of 
all, the estimators 7 and 7*, which are not to be changed, fall entirely under 


Theorem 4.2.1 and the corresponding part of Lemma 4.2.1. Hence, only the 
properties of the estimator 6* must be analyzed. 


LEMMA 4.3.2. Asn => œ 


SO a 4E(D - v)? i ads 
< 02 + 1302. = O(1/n), (4.3.9) 
where o? and of. are computed by (4.2.8 8) and (4.2.4), with account taken of 


the fact that the sample size is 2n. 
PROOF. Since |0*| < H(v), it follows that 
6* — 9* = min(H (Ù), |6*|) sgn 6* — min(H(v), |6*|) sgn 0*. 


Elementary computations involving the various possible cases show that for 
any positive numbers a; and az and any real numbers b; and by 


| min(a1, |b; |) sgn bı — min(ag, |b2|) sgn b2| < max(|ay — a2], |b) — b21). 
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Therefore, 
E(6* — 6°)? < Emax{(H(i) — H(v))?, (6° — 6°)”} 
< E(H (i) — H(v))? + E(f — 6")?. (4.3.10) 


The function H(t) is monotonically increasing on the semi-axis t > 0, where 
it varies from 0 to l and is concave. Since H(t) ~ 2t/3 as t > 0, and H'(t) 
is monotonically decreasing as t increases, it follows that 


|H() — H(v)| < H'(+0)|o - v| = 4|? — vl. 
From this we get by using (4.2.9) that 
E(H(v) — H(v))? < 4E(b - v)? < §(07 + 06- ). 


This together with (4.3.10) yields (4.3.9). 
We can thus make the following assertion. 


THEOREM 4.3.1. The statistics D,0*, and 7* are consistent asymptoti- 
cally unbiased estimators for the parameters v,0*, and T*, with mean-square 
deviation from the estimated parameter and with magnitude of bias of order 


1/ Vi. 


Estimators of the original parameters a, 3, and À can be derived from 
(4.3.6) by replacing the parameters v,*, and 7* by their estimators, i.e., 


a={(P)"?, B=B(D,6"), A= A0, 0T"). 


Since the quantities a, 8, and \ depend smoothly on the parameters vy, 6", 
and 7*, i.e., all the functions in (4.3.6) are jointly differentiable with respect 
to their variables (only one-sided derivatives apply on the boundary of the 
domain Q* of variation of the parameters), it follows that the assertion of 
Theorem 4.3.1 extends also to the estimators @, 3, and À. However, the ex- 
plicit formulas (4.3.6) and (4.3.4) connecting them with the parameters a, 8, 
and À being estimated turn out to be very unwieldy, and this complicates the 
determination of concise estimates of the mean-square deviations and biases of 
the statistics @, 3, and À from the corresponding parameters a, 3, and A. We 
do not give these extensive computations, but refer the interested reader to 
[166]. Here we analyze a simpler case involving estimators of the parameters 


a and À on the basis of the transformed sample Y/,..., Y$,- 
Let the sample mean Ap and the sample variance S be constructed from 
the collection of variables Vjo = log|Y.°|, 7 = 1,...,3n, and define ¥ = 


693 /? — 4. We consider the statistics 


Yo = max(4, ĉo) and To = Ao (4.3.11) 
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as estimators of the parameters y? = v,7° connected with the parameters a 
and À by (4.3.7) and (4.3.8). 

The great advantage of the estimators Do and 79 over the analogous es- 
timators v and 7* is the simplicity of the relations between the parameters 
they estimate and the parameters œ and À. Asa result the estimators @ and 
A corresponding to them have the following simple form: 


=, A= Lexpl(io + Ci /? — C]. (4.3.12) 


How well the statistics & and \ play the roles of estimators for the parameters 
of the original distribution can be judged by the next assertion. 


THEOREM 4.3.2. For any pairs of admissible values of the parameters a 
and à (in the form (A)), the estimators & and À of them constructed according 
to (4.3.11) and (4.3.12) from the transformed sample Y?,...,Y2, satisfy the 
inequalities 


E(a-a)*< : 


(22v? + 10v + 1)— + 8(2v + 1)? 


f SL 
an orn > 


os 


1 
3n(3n — 1) 
(4.3.13) 


and 


E(A- A)? <2 VE) + 27(log 2A + C)? Pa(vo)| 


2 1 8 

2 = 3n/2 a 

x rnax(1,4 ) + exp (2 (3 z) c) A> | a, for 3n > = 
(4.3.14) 


where P2(v) = n4 (36v? + 20v + 19) /60, Pg(v) ts a polynomial of degree eight 
coinciding with the value of Log if we set 0 = 0 in it, and the value of A, is 
gwen by (4.3.20). 


PROOF. We have that 
& — a = i? —y-V? = (vy — 9)/( vig? 4 dqv-1/2), 


The quantities v and i are not less than 4; therefore, the right-hand side of 
the equality is at most 4|v — Ďo| in absolute value. At the same time, 


[Čo — v| = |max(4, vo) — max (1,v) | < [ĉo — vl. 


Taking into account (4.1.17), (4.1.18), and the fact that the sample size is 3n, 
we get that 


Ene ee > Var $2 


leya zal 24 2 1 
= -4 [E(Vo) (Var Vo) = F < (Var Vo) n(n — 1)’ ; 
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where V? É Vo and Vo = Vo — EVo. According to (4.1.15) and (4.1.13), 


4 4 
7.4” 2 2...” 2 
E[Vo)" = 340 S0” + 20v + 3), (Var Vo) ag 2¥ + ir; 
from which it follows that 
Loy — (Var Vo)? i= = a (2207 + 10v +1). 
T 


The estimate (4.3.13) is now obtained fine trouble from the above inequal- 
ities and equalities. 
Further, the obvious inequality 


je” — e¥| < |z — y| max(e”, e”), r,y E Rt, 
gives us that 


A-A ce |log À — log A| max(A, A) 


= |(řo — 10) 1/? + (log 2A + C) =} max(A, À) 


Do + VV 
< (79 — To] + | log 2A + C| |o — v|) max(2X, 2A). 


Moreover, it follows from (4.3.12) that 
max(2\, 2A) < max(1, 2A, exp(2řo + C)). 
By Hölder’s inequality, these inequalities imply the estimate 
E(\ — A)? < [E(|řo — Tol + | log 2A + C| |o — v|) are 
x [E max(1, (2A)*, (2)*)]?/? 
< 2[(E(7o — 70)*)!/? + (log 2A + C)?(E(Zo — v)*)*/7) 
x [max(1, 4A?) + (Eexp(4C + 879))'/?]. (4.3.15) 
Since |i — v| < | — v|, we have from Lemma 4.1.2 that 


4 
E(% = v)4 < E(v = y)* = = SES? - Var V 0)*. 


The sample size is 3n; hence, the last expression can be given the form 


EE 
Ef m ae oe e 
pr (== :) E zz 2 (Vo — Var Vo) - Paa 


3 rar 
4 4 3 i 3 p 
4-6 3n 1 72 Dees 
Se a aeeeicntecenl = 
<7 F (aSr vt) se (457) 


To estimate the last sum we use the following easily verified inequalities. 
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If X1,..., Xn, N > 2, are independent and identically distributed random 
variables with mean value a, then 


L2 ' 
E N (X;-a)} <4E(X, —a)*N-?, 
j=1 

8 
1 & 
z (š we - o) < 15E(X; - a)®N-?. 
Moreover, simple computations show that 


_9 __g 
E(Vo — Var Vo)* < EV) = L3, = Cg (0, poo,... , Pos) 


a. 
II 
j= 


(here the superscript 0 reminds us that we are dealing with the parameters 
vo = v and ĝo, and the function Eis depends only on them). 
Consequently, 


. 4-64 / 3n \4 E 
E(Čo —v)* < -8 (5) (4L0s + 15Ls) (3n) ~? 
26.34.19 / 3n \4 2 
= = ( ) Cg(0, po2,---,Pos)(3n)~?. 


3n—-1 
The polynomial Cg has a fairly complicated expression, but in our case 
(when po: = 0) it becomes essentially simpler and takes the form 


r8 


Cs = Pos + 28po6Po2 + 56pospo3 + 35p34 
+ 210po4p}2 + 280p53p02 + 105p%o, (4.3.17) 


where (since ĝo = 0) the quantities po, are functions of the single param- 
eter v: 


Po2 = T (2v +l}, Pos = 24¢(5)(v°/? — 1), 
6 
pos = 2¢(3)(v3/? — 1), pos = 355 (3213 + 31), 
4 3 
Pos = ilb? + 7), Pos = 349 (12814 + 127). 


Consequently, L8 = Pg(./v) is a polynomial of degree eight in the variable 
Vv. If desired, it can be written explicitly by using (4.3.17) and the explicit 
expressions for pox. Thus, 


4 
Eö — vy < EE (TM) ADG- (4.3.18) 


(note that 4-19-64/7® < 180). Further, 
3n 


ip -t =3-) Vo, 


j=1 
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which, by (4.1.15) and (4.3.16), implies the estimator 

E(žo — 10)* < 4E(Vo)*(3n)~? = Po(v)(3n)~’, (4.3.19) 
where 
-T 
-60 


To conclude the construction of the estimate (4.3.15) it is necessary to estimate 
the quantity 


P2(v) (36v? + 20v + 19). 


(Eexp(4C + 87))!/? = exp(2C)(E exp(870))'/?. 
We have that 


; 8 3n 
E exp(879) = E exp E S (Vo i J 


j=1 

8 3n 
= oo 0)8/3n 
= exp ( = 70) E Il |Y; | 

j=1 

8 3n 
a 0\8/3n 
= exp (- $70] I] E|Y; | ad 


j=1 


Here it should be recalled that in the case 8/3n < a the expectation E|Y,?|8/°” 
exists and that, by (2.6.26), it can be given the form 


E|Y°|8/3" = wo(8/3n, a, 0,0, 2) = (2A) 0 An, 


where 
4r\ T! 8 8 
Consequently, 
exp(2C)(E exp(879))!/? = exp f2 (2 log(2A) — 27) + c) | yb 


= exp (2 (3 -= z) c) us, (4.3.21) 


It is not hard to see that A3”/? ~ exp(4C(1/a—1)) as n — oo. Therefore, 
for large n, 


(Eexp(4C + 879))!/? ~ exp(2C). 


Inequality (4.3.14) is now obtained by combining the estimates (4.3.18)- 
(4.3.20) with inequality (4.3.15). 
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COROLLARY. The statistics & and À formed according to (4.3.11) and 
(4.3.12) are consistent and asymptotically unbiased estimators of the param- 
eters a and à, with the bias and the mean-square deviation of the estimated 


parameters of order O(1/,/n). 


REMARK. In this whole section the estimators of parameters of a stable 
distribution have been associated with an original sample size of 6n, n > 1. 
This assumption enabled us to form the transformed samples Fi sceri oq SO 
Y?,..-, YO without anything left over, and thus the corresponding satirnabone 
were AE ted with samples of sizes 2n and 3n. 

All the inequalities given in Theorems 4.3.1 and 4.3.2 remain true, of course, 
if instead of the sequences {2n} and {3n} in them we consider the sequence 
of integers n = 2,3,.... This corresponds to original samples of various sizes 
(3n in the first case, and 2n in the second). 


§4.4. Estimators of the parameter y 


The parameter y occupies a special position in the scheme of the above 
approach to solving the general problem of statistical estimation of parame- 
ters of stable laws. The device used to construct estimators of the parameters 
a,, and À amounted to transforming the original sample into a new sample 
of smaller size which was associated in turn with distributions in the class Ws, 
and within this class the logarithmic moments have comparatively simple ex- 
pressions in terms of the parameters being estimated. An analogous universal 
transformation of the original sample that would enable us to estimate the 
parameter ~y in the same way has not been found, and it seems that such a 
transformation does not exist in general. Therefore, new ways must be found 
to construct estimators of y. 

In the case a > 1 the distribution G4(z,a, 3,7, À) has finite mean value 
equal to Ày, and this makes it possible in principle to use the sample mean for 
an estimator of it. However, these distributions do not have finite variances 
for a < 2, only finite moments of order less than a. The scatter of the values 
of the sample mean with respect to the variable Ày being estimated thus turns 
out to be large, and it is all the layer, the closer a is to 1. 

The main idea for constructing an estimator for the parameter y from an 
independent sample Y;,..., Yn of sufficiently large size reduces to the follow- 
ing. Let a, 6,7, and À be the parameters of the stable law (in the form (A)) 
which is the distribution of the random variables Y;. We consider a simpli- 
fied variant of the problem, assuming that we know the parameters a, 3, and 
à, and that only the single parameter y must be estimated. In the general 
situation we must first estimate a, 3, and À, and then take into account the 
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effect of substituting estimators of these parameters for their exact values. A 
problem of this kind is significantly more complicated, and we do not treat the 
general case, recalling that the purpose of this chapter is rather to illustrate 
than to be independent. 

It is assumed below that Y/, Y3,... is a given sequence of random numbers 
distributed according to the stable law Ga4(z, a, 3,0,A) and that the values 
of the parameters a, 3, and A are known. Of course, the problem of gen- 
erating such a sequence of numbers can itself turn out not to be a simple 
matter, since, generally speaking, we have at our disposal only some fairly 
complicated integral representations of the function G 4(x, a, 3,0, A), or series 
representations of it. But these difficulties are of a computational rather than 
of a fundamental theoretic nature. 

The original sample Y;,..., Yn is transformed with the help of the random 
numbers Y;,..., Y,, into a new collection of variables Yi,..-,Yn by the rule 


H= h-E ns ce ee 


The variables Y; can be interpreted as an independent sample from a col- 
lection subordinate to the stable distribution. According to Properties 2.3b 
and 2.1, 


Ý, =~! ¥4(a, 0, 7/2, 24) £A7!¥a(a, 0,0, 2A) +4, 


from which it is clear that the Y; have a symmetric stable law biased by the 
quantity y. More precisely, 


F(x —y) = P(Y; < £) =Ga(A(x — >), 0,0,0, 2A). 


Consequently, y coincides with the median of the distribution F (x — 7), and 
this, in turn, allows us to use the sample median method well known in statis- 
tics for estimating + (see, for example, [84]). 

Let us arrange the observations Ý; in increasing order and denote the terms 
of the resulting sequence by w;: wı < w2 < +--+: < Wn. The sample median un 
is defined by 

_ f Win+1)/2 if n is odd 
Hn = i 5 (W(n+2)/2 + Wn/2) ifn is even. 


We take this statistic as an estimator of the unknown parameter 4, i.e., 
we let 7 = un. The cases of even and odd n do not differ in principle, and 
the corresponding asymptotic analysis (as n — oo) of the properties of the 
estimator y leads in both cases to the same conclusions, but in a technical 
respect the case of odd n is somewhat simpler, and for this reason we confine 
ourselves to an analysis of the case when n = 2m + 1. 
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Let Fp (x) = P{¥—7 < z}. It is not hard to compute (see §17 in [84]) that 


n 


Pia Y}. (POU - Fa (4.4.1) 


k=m+1 
Since the distribution function F(x) has a density 
p(x) = Agali a, 0,0, 2A), 


F,,(x) also has a density, and, as we see by differentiating (4.4.1), 


pala) = Fila) =n( Z) Raa - Paa) 
= an exp(-mu(2))B(2), (4.4.2) 


where (z) = — log F(x) — log(1 — F(z)) and a, = n{?), 
It follows from (4.4.2) that the bias E(Ẹ — y) and the mean-square error 
E(+ — +)? of the estimator ¥ can be written as the integrals 


EG <n) Sm / p(x) exp(—mv(zx)) dz, (4.4.3) 


E(j— 7)? = an | 2%9(2) exp(—mu(2)) dr. (4.4.4) 
The distribution F(z) is symmetric, i.e., 1 — F(z) = F(—z) and p(x) = p(—z) 
for all z. This implies, first, that the functions y(x) and p,(x) are even, and, 
second, that pp(rz) ~ const 2~%™+1)-1 as x — oo (by 2.4.48), (2.5.4), and 
(2.5.23)). Consequently, the integrals (4.4.3) and (4.4.4) exist if n > 4/a — 1 
(the first integral exists if n > 2/a — 1), and, moreover, the integral (4.4.3) is 
equal to zero, i.e., the estimator y is unbiased. 
Let us now turn to the determination of the asymptotic behavior of the 
integral (4.4.4). Since the distribution F(z) is symmetric, 


(0) =2log2, = y"(0) = 0, 


4.4.5 
p" (0) = 8p?(0) = 8293,(0, a, 0, 0, 2A). a 


The value of the density of a symmetric stable distribution at zero is known 
(see (2.2.11)). Therefore, by (2.1.2), 


p" (0) = 2/41 (1 + 1/a)(2A)!-1/2]? > 0. (4.4.6) 


Properties (4.4.5) and (4.4.6) enable us to use the Laplace method to obtain 
an asymptotic representation of the integral (4.4.4) as n — oo; we already 
encountered this method under analogous circumstances in §2.5 (see [20}). 
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According to this method, standard arguments give us that 
In = an | 225(x) exp(-mu(z)) dz 


=i exp(—mu(0)) | 2°52) exp (wog +- ) dr. 


Since 02 = my” (0) — œ as n — ov, we get the following asymptotic formula 
after the change of variable „z = t and corresponding estimates of the 
“tails” of the integral: 


Lo ane a enon) { fonie iida 
= (2m + 1) (E) 2" 8m) VIR (H(0))-* ~ent, 


where 
c = (1(2A)U-%)/¢ /T(1 + 1/a))?. (4.4.7) 


These arguments can now be summarized as the next statement. 


THEOREM 4.4.1. The statistic 7 = Un 18 a consistent unbiased estimator 
of the parameter y for all odd n > (2 — a)/a, with mean-square error 


E(¥-7)?~cen"1 asn > oo, 


where the constant c is given by (4.4.7). 


REMARK 1. For even n the statistic 7 = un is a consistent and asymptot- 
ically unbiased estimator of y. 

REMARK 2. We indicate one more variant of an estimator for ~y under the 
assumption that the values of the remaining parameters are known. Formally, 
this variant has more advantages than the one considered above, because 
it does not require a transformation of the original sample. The following 
considerations are taken as a basis for the estimator. According to (2.1.2), 


G(x, a, B, y, à) = G((z — I)A~"/%, a, B, 0, 1), 


where | = A(y + bo), and bo is uniquely determined by the parameters a, 8, 
and à. From this it can be concluded that the median m(a, 3, +, A) of the dis- 
tribution G(x, a, 3,7, A) is connected with the median m(a, 3) = m(a, G, 0,1) 
by the relation 


A~1m/(a, b, y, A) = amini a, B)+bo +7. (4.4.8) 


We then consider the sample median jf constructed from the original sam- 
ple Y;,..., Yn with distribution G(x, a, 8,7, A). Because of the nice analytic 
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properties of stable laws, the statistic u» turns out to have an asymptotically 
normal distribution with mean value u = m(a, 3, Y, À) and variance 


o? ~g*(u,a,8,y,A)(4n)-!, = n+ 00 
(see, for example, [84]). 


Consequently, replacing the median y in (4.4.8) by the sample median LL, 
we get the estimator of + 


J = ATA — bo — cA 1O71, (4.4.9) 


where c = m(a, ) is the unique solution of the equation G(z, a, 8) = 5: 
Like the estimator in Theorem 4.4.1, 7 is an asymptotically ETES and 
consistent (more precisely, 1/y/n-consistent estimator of the parameter y. 


§4.5. Discussion of the estimators 


The estimators given in §§4.2—4.4 for the parameters of stable laws are not 
best possible estimators even in the asymptotic sense as the sample size n 
increases to infinity. This is due, first of all, to the fact that all the estima- 
tors were constructed on the basis of the sample moments and the sample 
median—statistics of a simple structure that do not, as a rule, have the great- 
est efficiency. Also, in constructing estimators in the general situation we 
used a transformation of the original sample which led either to a reduction 
in its size by a factor of two or three, or to an increase of the scatter of the 
random variables making up the sample in isolated cases. At the same time 
the estimators we found have a number of merits that not only enable us to 
regard these estimators as a convenient tool for solving practical problems, 
but also make them a good basis for constructing best possible (in a definite 
sense) estimators of the corresponding parameters. 

If for the time being we agree to let ji stand for an estimator of the param- 
eter u (which can be any of the parameters considered in §§4.2-4.4), then in 
respect to its properties we can assume that it: 

1) has an algorithmically simple structure (this applies, first of all, to the 
estimators of the parameters a,6, and 7 corresponding to the form (£)); 

2) is asymptotically normal; and 

3) has mean-square deviation E(ji — u)? of order 1/n. 

Property 2) has not been proved for any of the estimators in §§4.2-4.4, 
but it can be established without difficulty with the help of the central limit 
theorem by using the “locally additive” structure of the estimators. By the 
obvious inequalities 


(Eğ — u)? <E(i—p)?, Var < 4E (fu — ys)’, 
P(Vīlä -u| > T) < nE(ñ- p)? /T?, T>0, 
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property 3) implies that jf is asymptotically unbiased with bias of order at 
most 1/,/n, that the variance of the estimator has order at most 1/n, and, 
finally, that ù is a 1/,/n-consistent estimator of the parameter p. 

We mention another circumstance that should be pointed out. The con- 
structions of estimators for the parameters of stable distributions were car- 
ried out within two groups. One included the parameters a, 8, and à, and 
the other the parameter y. These groups differed first of all in the form 
of the transformations of the original sample in constructing the estimators. 
The estimators of the first group can be regarded in total as the coordinates 
of a three-dimensional random vector which is obviously a 1/,/n-consistent 
estimator of the corresponding triple of parameters. 

Contemporary mathematical statistics has available several methods for 
improving 1/,/n-consistent estimators, and they make it possible in principle 
to construct asymptotically efficient estimators of the parameters of stable 
laws, at least within the indicated groups, on the basis of the estimators we 
obtained. These methods can be divided into the following two categories. 

The first contains methods that do not use information about the analytic 
expression for the distribution density of the sample units. Here we cite the 
comparatively recent paper [143], which contains, in particular, a brief survey 
of other publications in the same direction. 

The second category includes methods in which knowledge of the distri- 
bution density of the sample units is assumed. Among the large number of 
papers connected with the use of such methods we recall [18], whose results 
can form a convenient basis for solving the general problem of asymptotically 
efficient estimators for stable distributions. 

The preferability of the methods in the first category is obvious, especially 
if it is considered that the regularity conditions they use for the distributions 
of the sample units are weaker than the conditions usually present in the 
methods of the second category. Unfortunately, they have been considerably 
less developed than the latter methods and have been connected only with 
the problem of estimating the scalar shift parameter of a distribution, which 
allows us to use them only for estimating the parameter y and, in some cases, 
A. Therefore, in solving the problem of constructing asymptotically efficient 
estimators in the general situation we must resort to methods of the second 
category. Furthermore, it can be said at once that in constructing asymptot- 
ically efficient estimators of the first group we do not encounter fundamental 
difficulties on the indicated path. The difficulties begin if we want to construct 
similar estimators for the whole collection of parameters, since the problem 
of constructing a 1/,/n-consistent estimator for the parameter y has not been 
solved when the remaining parameters are unknown. 
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In the framework of this discussion we cannot acquaint the reader in detail 
with the substance of the general results in the paper [18] cited above; all 
the more so, we cannot verify that these results can be used in the problem 
of estimating the parameters of stable distributions. We give only an exposi- 
tion of the main idea of the method in the simplest situation when only one 
parameter u is to be estimated. 

In the role of u we can take any of the stable law parameters for which we 
have a 1/,/n-consistent estimator. At the same time, it should be mentioned 
that the possibilities of this method are much broader, and that it enables 
us to obtain asymptotically efficient estimators for the parameter collection 
u = (a, B, À), since, as we have seen, there are 1/ /n-consistent estimators for 
the vector-valued parameter p. 

Let p(z, u) be the density of the distribution of independent random vari- 
ables X;,..., Xn forming a sample from which we want to construct an asymp- 
totically efficient estimator of the parameter u, given that we have a 1/,/n- 
consistent estimator ji of this parameter. Let 


L(X | u) = Lose(x pE 


be the likciihood function, and let L'(X | u) and L”(X | u) be its first and 
second derivatives with respect to p. 
It turns out that the statistic 


Ô= ñ- L'(X | à)/L"(X | à) (4.5.1) 


is an asymptotically efficient estimator of u under certain regularity conditions 
for the density p(x, u), for example, the existence and continuity of the second 
derivative of p(x, u) with respect to u, etc. In the cases when the Fisher 
information 


I(un) = / (2 tog. D) ple.) da 


associated with the distribution corresponding to p(x, u) can be computed it 
is possible to use the statistic 


w= + L'(X | à)/(nI(u)) (4.5.2) 


instead of (4.5.1). 

The fact that an explicit expression for the density is used in forming the 
statistics ĝ and u* creates certain inconveniences in the case of estimating the 
parameters of stable laws, because with some exceptions we have available only 
complicated forms of expressions for the corresponding densities as series or 
integrals. However, these are not the difficulties that arise in constructing the 
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maximum likelihood estimator uo, where one must solve the transcendental 
equation L’(X | u) = 0 for the variable u (uo is the solution of this equation). 

Suppose, for example, that we are looking for the value of only the single 
parameter p of a stable law under the condition that we know the remaining 
parameters. To use the estimator (4.5.1) we obviously must at least have 


available some tables of values of the functions 
O ð? 
ðu logp(z,u) and ðu? log p(z, p). 


The tabulation of these functions is not a very complicated task. The attempt 
at compiling such tables for the stable distributions themselves can serve as 
an affirmation of this (see [6] and [31]). 

In constructing analogous estimators for several parameters (i.e., for es- 
timators of vectors) the problem becomes significantly more complicated on 
the computational level, since it is required to tabulate all the first and sec- 
ond mixed derivatives of the logarithm of the density with respect to the 
parameters to be estimated. 

As we know, among the stable laws there are those whose densities can 
a expressed with the help of elementary function. The parameter sets (a = 
ł, = 1,7,A), (a= 1,8 = 0,7, å), and (a = 2, 8,7, ) correspond to them. 
The last set, which corresponds to the normal distributions, is well enough 
known that there is no need to comment on the associated problem of estimat- 
ing parameters. The remaining two cases in this scheme are less known. It is 
thus useful, in our opinion, to analyze these cases by way of illustrating the 
considerations given above. We consider the simplest problem of estimating 
one of the parameters ~y or À under the condition that the value of the second 
is known. 

In the first case, 


a pe e7 exp EE 
glep DnA = gle- ep -y >A (453) 


Assume that the value À is known, while the value of y is unknown and must 


be estimated. We form the likelihood function L(Y | y). Its derivative with 
respect to 7 has the form 


3 
L(Y |) = PL tT py 
J 
The likelihood equation L’ fz | y) = 0, although it reduces to an algebraic 
equation, does not allows us to write out the maximum likelihood estimator. 
Therefore, we look for an asymptotically efficient estimator + of the param- 
eter y by following the general hints given above. First of all, the explicit 
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form (4.5.3) of the density enables us to compute the corresponding Fisher 
information J(). An uncomplicated computation shows that I(y) = 42/?. 
This permits us to construct an estimator 4 of the form (4.5.2). 

At our disposal are two 1/,/n-consistent estimators of the parameter y. 
One was given in Theorem 4.4.1, and the second in the remark accompanying 
this theorem. The most convenient estimator in this case turns out to be 
(4.4.9). Namely, 

ğ= Ati- eÀ, 
where ñ is the sample median and c = 1.08... is the solution of the equation 


G(z, 9,1) = 2[1 - 6(1/V2z)] = J, 


where ® is the distribution function of the standard normal law. 

In summary, with the help of (4.5.2) we obtain the statistic 

.. 5 
4=F+ gt Y | ¥)n7! (4.5.4) 

as an asymptotically efficient estimator of y. 

Suppose now that the value of y is known, but the value of \ must be esti- 
mated. The course of the arguments is as before. We construct the likelihood 
function L(Y | À) and then find its derivative with respect to A: 


= 3 e “ 
LY Ia sat + (34—a) 205 -a R -a 
J J 
Of course, we should not try to solve the likelihood equation L’(Y | A) = 0, 
and we must construct an asymptotically efficient estimator \ of À according 
to the indicated rule. However, contrary to the preceding case, the Fisher 
information J(\), though it is computed, turns out to depend on À. Therefore, 
À should be constructed by the rule (4.5.1). The needed 1/,/n-consistent 
estimator À of A can be found in §4.3 (equality (4.3.12)). Let us compute the 
second derivative of L(Y | A) with respect to A: 
L"(Y | A) = nà? - X (Y; — ya)7} 
F 
1 -2_ 1,22 -3 
+ 31089 — A) A — WA)? — 579 dM -07 


x 


According to the rule (4.5.1), the asymptotically efficient estimator \ has the 
form se : 

ASA = L(Y | d)/L'"(Y | A). 
As we see, the estimator of À has turned out to be more complicated than 
the estimator of y. This has to do in the first place with the choice of the 
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parametrization system. If instead of the system (a, 3,7, A) we had considered 
the system (a, 3,7’, A), where 7/ = 7A, then the complexity of the estimator 
of >’ in the case when a = 5 and 3 = 1 would have stayed the same, while 
the estimator of À given that y’ is known would have been essentially simpler 
to obtain, since the likelihood equation is fairly simple to solve. It turns out 
that the resulting maximum likelihood estimator is a sufficient statistic (see 
[166]). 

Asymptotically efficient estimators of one of the parameters + or A in the 
case when a = 1 and 3 = 0 can be constructed in the same way with the help 


of the explicit expression 
A 
g(x, LUS À) = a(x ~~ yA)? + T a 


for the density. For the parameter system (a, 8, Y’, À) mentioned above, the 
problem of constructing an asymptotically efficient estimator of +’ for known 
À can be found in [84], but there it is solved with the help of an obsolete 
iteration procedure. 

We conclude the discussion of the problem of estimating the parameters of 
stable laws by another remark. It is well known how important in practice 
it is to construct confidence intervals for parameters being estimated. In 
general, we have not treated this question, although the estimators given for 
the parameters v, 0, and 7, due to their simple structure, enable us to construct 
confidence intervals for them in a fairly simple manner. For example, the 
estimator for 0 is reduced to an estimator for the parameter p = (1 + @)/2 
in the binomial scheme (see the explanation after Lemma 4.2.1), and the 
problem of constructing confidence intervals for this estimator has become 
classical (see [84] or [43]). 


§4.6. Simulation of sequences of stable random variables 


In creating diverse methods for estimating the parameters of stable laws, 
theoreticians need empirical tests of the effectiveness of these methods. The 
problem thus arises of creating algorithms for constructing numerical se- 
quences that could be interpreted as sequences of observations of independent 
random variables with a common stable law. 

This problem has been considered by several authors in [40], [178], [246], 
and [250]. The complexity, which is based on the relation 


Y (a, B, 7,4) =G7!(w, a, B, y, A) 


where G~! is the function inverse to the distribution function G and w is a 
random variable uniformly distributed on (0, 1), turns out not to be very suit- 
able in the general case. The reason is that there are no analytic expressions 
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for the function G~!. The only exception here is the case when a = 1 and 
B = 0, for which 


Y (1,0, 7, A) ÉA tan[r(w — 1)] + Ay. (4.6.1) 


The case a@ = 2 is in a special position, of course, since the problem of 
simulating sequences of normally distributed random numbers has long had 
the attention of specialists, and considerable progress has been made toward 
its solution (see [252]). 

The multiplicative relations considered in §3.4 (the relations (3.4.13) and 
those in Theorem 3.4.3) help us to obtain a partial solution of the problem. 

Indeed, having the possibility of simulating sequences of normally dis- 
tributed random numbers, we can use (3.4.13) to generate sequences of ran- 
dom numbers having stable distributions with parameters a = 1/2", 8 = 
1, y = 0, and à = 1 for n = 1,2,..., and then with arbitrary parameters y 
and (due to Property 2.1). 

In precisely the same way, having created an algorithm for simulating se- 
quences of random numbers distributed like I’, jn (see Theorem 3.4.3), we 
have the possibility of constructing sequences of random numbers with dis- 
tribution G(x, 1/n, 3,0, 1) (and then also with distribution G(z, 1, /n, By, A)) 
for Te 2,3, vine 

The solution of the problem in the general situation was found by a path 
started in 1973 in the article [40], in which the particular case of the repre- 
sentation (2.2.18) with 0 < a < 1 and £ = 1 is used to obtain the relation 


Y (a, 1) (A(rw)/ E) -9/2 (4.6.2) 


where 
sin(1 — a)p /sinay \ 07% 
sin ay ( sin p ) i 
and the random variable Æ, which is independent of w, has the standard 
exponential distribution (see the paragraph before Theorem 3.4.2 in 83.4). 
The idea for obtaining (4.6.2) is based on the fact that the distribution 
function of the random variable (Y (a, 1))e/0-2), 0 < a < 1, is 


: J " exp(—2A(p)) dp. 


The general case was investigated in the same way in 1976 in [246], where 
the representations (2.2.27) and (2.2.28) of the distribution functions of stable 
laws were obtained as a basis for the author’s arguments. The following 
relations were found there: 

If a # 1, then 


A(p) = 


Y (a, 8) By(w — 1/2) E1- 1/2 (4.6.3) 
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where w and Æ, as in (4.6.2), are independent and have a uniform and an 
exponential distribution, respectively, 


x T /a 
sin Za(y + 0) co 1 
Ba eee Eee aa a oo. 
2 cos Z [p — alp + 0)] sin 5p le 27 
and 0 = BK (a)/a. 
If a = 1, then 
Y (1,8) = Bı (w — $) — 26blog E, (4.6.4) 
where w and E are the same as before, and 
2 if T 1 
Bilo) = tog (#22) + (1+ Bp)tande, lel <5: 
T cos 79 2 Z 


The relations (4.6.3) and (4.6.4) turned out to be not only very convenient 
for constructing sequences of random numbers, but also fairly reliable with 
regard to the quality of the random sequences generated with their help. This 
has been verified in the process of a comparative analysis of different methods 
for estimating the parameters of stable laws. 

In addition to what was said about the second way of solving the random 
number generation problem, we direct attention to the end of §2.2, where 
mention is made of the possibility of constructing representations that are 
analogous to (2.2.18) and (2.2.19) but at the same time differ essentially from 
them. It is not excluded that such variants of representations will enable us 
to obtain new relations of the type (4.6.3) and (4.6.4) and to exploit them in 
constructing sequences of random numbers (of course, the advantages of these 
variants look very problematical so far). 


Comments 


In our desire to lighten the burden on the main text as much as possible 
we have gathered here facts of a historical nature, along with facts which, in 
our view, are not of paramount importance; in particular, information about 
who is credited with various results and when they were first published. The 
absence in these Comments of an explanation and reference for some theorem 
means that the author does not have such a reference, and possibly that the 
corresponding statement is being published for the first time. 


Introduction 


I.1. Pólya introduced in probability theory a certain class of characteristic 
functions which has come to bear his name. This class is formed by the real- 
valued functions f(t) defined on the whole t-axis and having the following 
properties: 


O)=1, F(-th=f(t), F<, f"(t)>0, t>o. 


The functions f (a) , 0 < a < 1, are easily seen to be in the Pólya class. See 
[68] for details about the Pólya class. 


I.2. By definition, the class of infinitely divisible laws consists of the distri- 
butions G that can be represented as compositions of n identical distributions 
Gn for any integer n > 2. In 1934 Lévy obtained a description of the distri- 
butions G in this class in terms of the corresponding characteristic functions 
g(t). 

The so-called Lévy canonical form of expression for the functions g(t) is 
given by (1.3). 

Thus, Theorem A is a rephrasing of the Khintchine theorem asserting that 
the class 6 introduced coincides with the class of infinitely divisible distribu- 
tions. This result was first published by Khintchine in the paper Zur Theorie 
der unbeschrankt teilbaren Verteilungsgesetze, Mat. Sb. 2(44) (1937), 79-117. 
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In his investigations Khintchine employed another form of expression for 
the functions g(t), now called the Lévy-Khintchine canonical form. Theorem 
B, which was proved by Gnedenko, corresponds in the original paper to the 
Lévy-Khintchine canonical form, and not to the Lévy canonical form, with 
which the formulation we have given is associated. 


1.3. The variants of expressions in the literature for the functions g(t) have 

the form 
g(t) = exp(ity — Av(t, a, ĝ)). 

In the forms of expression we used for the function log g(t) the parameter 
+ is replaced by Ày, and as a result the function becomes proportional to A. 
It turns out that this form of expression has analytic advantages, though at 
the same time it excludes degenerate distributions from the description of the 
family G. It is perhaps surprising that until now no one has wanted to write 
the characteristic functions of stable laws and the characteristic functions 
g(t, À) of homogeneous stable processes X(A) in a coordinated way. The fact 
of the matter is that (see [77]) 


log g(t, 4) = Alog g(t, 1), 


where A > 0 signifies a time parameter. 

The form (A) of expression for log g(t), which is the best known form and 
until recently has been the one most often used, was proposed by Lévy in [51]. 
In the case a = 1 this formula was given independently of Lévy by Khintchine 
in [44]. In Lévy’s monograph [50] the form (A) was preceded by the following 
modification of it, obtained for strictly stable laws: 


log g(t) = AT (—a)(cos(7a/2) — iB sin(ma/2))t, 


where 0 < a < 2, |8| <1, A>0, and t > 0. 

The forms (B), (M), (C), and (E) apparently appeared for the first time 
in the author’s papers [96], [97], and [167]. The merits of the form (B) and 
(C) were pointed out long ago. For example, the form (C) can be encountered 
in Feller’s monograph [22], while systematic use is made of the form (B) in 
the monographs of Ibragimov and Linnik [35] and Lukacs [54]. 


1.4. The following equalities establish a connection between the constants 
Cı and C2 in the expression (1.10) for the spectral function H(z) of a stable 
law given in the form (A) by the parameters a, 8, y, and À: 


Cı = X(14+ b)n ~T (a) sin(ra/2), 
C2 = ÀA(1 — b)n T (a) sin(ra/2). 


I.5. Criteria 2 and 3 stem from work of Lévy. 
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I.6. If (1.27) is replaced by the equality 
C1X1 +c2X04+--- +o, X= Xs + a, 


where a is a constant, then we obtain a criterion for a nondegenerate distri- 
bution to belong to 6. 


I.7. There is one particular error which is repeated in fairly many papers 
(connected in one way or another with stable laws). Namely, to describe the 
family G they use form (A) for log g(t) with the sign in front of itw,(t, a, 3) 
chosen to be “minus” in the case a # 1. Along with this it is commonly 
assumed that the value 8 = 1 corresponds to the stable laws appearing in the 
scheme (I.6) as limit distributions of the normalized sums Zn With positive 
terms. But this assumption is incompatible with the choice of “minus” in 
front of itw4 in the form (A). 

The error evidently became widespread because it found its way into the 
well-known monograph [26]. Hall [29] devoted a special note to a discussion of 
this error, calling it (with what seems unnecessary pretentiousness) a “comedy 
of errors”. 

Though he is undoubtedly right on the whole, in our view he exaggerates 
unnecessarily, presenting the matter as if the mistake he observed is almost 
universal. In reality this defect in [26] was noticed long ago. For example, 
special remarks on this were made in the papers [95] of Zolotarev and [79] 
of Skorokhod. And in general there are more than a few papers and books 
whose authors were sufficiently attentive and did not lapse into this “sin”. 
For example, we can mention Linnik’s paper [53] and Feller’s book [22]. 


Chapter 1 


1.1. In the late 1950s and early 1960s a probability seminar met at Moscow 
State University. At one session the leader, E. B. Dynkin, related the following 
curious problem, which I reproduce here as I remember it. 

Consider in the (z, y)-plane a Brownian motion of a particle beginning at 
: (0,0) and subject to a constant drift in the direction of the vector a = (a, a2) 
with ay > 0 and a2 < 0. The line y = —1 is an absorbing barrier. For 
such a drift the particle is absorbed with probability 1. Let (Xo,—1) be the 
absorption point. It turns out that in the case when a; > 0 and a2 < 0 the 
random variable Xo has a normal distribution on the line y = —1, while in 
the case when a; > 0 and az = 0 it has a stable distribution with parameters 
a = 1 and 6 = 1 (here the parameters can be associated both with the form 
(A) and with the form (B)). But if ay = 0 and a» = 0, then Xo has a Cauchy 
distribution (i.e., a = 1 and 8 = y = 0). 
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1.2. A model of point sources of influence based on the use of a Poisson 
distribution for the number of points in a finite region is visible as far back 
as the short note of Lifshits [117], which dealt with the solution of a certain 
problem in physics (it is discussed in more detail in the second example). 
Somewhat later a particular case of this model became the subject of an 
independent investigation in Good’s paper [112]. It is possible that interest in 
the model itself was stimulated by the paper of Holtsmark [114] (see Example 
1) or by a result of that paper in Chandrasekhar’s book [103]. 

But neither Holtsmark nor Chandrasekhar relied on a model in which the 
number of particles falling in a bounded volume is random and has a Poisson 
distribution. As a result their method looks more unwieldy and less universal. 

A particular case of the model of point sources of influence lay at the 
basis of the solution of a certain problem in solid-state theory considered by 
Zolotarev and Strunin in [141]. 


1.3. The Holtsmark distribution has become widely known in probability 
theory because of the books of Chandrasekhar [103] and Feller [22]. There does 
not exist an explicit expression for the distribution density of v in elementary 
functions. However, a relatively simple expression is known (see [168]) for the 
distribution function of the length of v: 


nm /2 
P(\vA-2/3| < r) =1- = | (1+ 3ar°) exp(—ar?)dy, r>(, 
0 


where a = (cos p)? cos(p/2)(sin(3p/2))~3. 


1.4. The example is taken from an article of Ovseevich and Yaglom [132]. 
This contains an error which essentially affects the content of the whole paper. 
After observing that the time function F(t, A) of the types of lines considered 
can be represented as a composition of any number n of identical time func- 
tions, F(t, A) = F(t,A/n) *--- * F(t, A/n), the authors conclude that F(t, A) 
must be the distribution function of a stable law. However, the property they 
noticed actually characterizes a broader class of distributions (see Comment 
Liz} 


1.5. Only after the manuscript was prepared for printing did I become 
aware of an interesting instance of stable laws in the hereditary theory of 
elasticity. It is contained in the book The mathematical theory of wave propa- 
gation in media with a memory by A. A. Lokshin and Yu. V. Suvorov (Moscow 
State University Press, Moscow, 1982). 

The general equation of motion of a homogeneous hereditary-elastic rod 
of unit density performing longitudinal oscillations under the action of an 
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external load f(t, xz) has the form 


g g? p? 
Vu= f, where V = a ga RO y 


and where t is the time, z is the coordinate of a point of the rod, and R(t) is 
the so-called relaxation kernel, which is connected with another characteristic 
of the rod—the creep kernel K (t)—by the relation 


R(t) = K(t) — K(t)* K(t) +. 


(D) 


In the case when the creep kernel has the form K = p + T * p on the 
interval 0 < t < to, where p(t) = ht-°E(t)/T(1—a), h > 0 0< a i, 
and E(t) is the unit distribution, the fundamenta] solution €(t,x) of (D) is 
nonzero in the angle t > |z| and can be written with the help of the functions 
G(x, a, 1) in the region |z| < t < |z| + to: 


t= £ (( — z) / (512 ar 7 


f ae (o -kD / a) “a 7 dy. 


Stable laws appear in other problems in the theory of hereditary elasticity in 
connection with the fact that the creep kernel K (t) = 3—a(—b, t) of Rabotnov, 
which is popular in this theory, is related to the Mittag-Leffler function E, (zx) : 


alb, t) = (1 + a)t? E; „(bt!+9), 


Very recently I became acquainted with the paper “Stable measures and 
processes in statistical physics” by A. Weron and K. Weron, and with the 
survey paper “Stable processes and measures: A survey” by A. Weron, both 
published as preprints in the Center for Stochastic Processes at the University 
of North Carolina.* These papers contain a number of interesting examples 
of the appearance of stable processes in problems in contemporary theoretical 

physics. 


Chapter 2 


2.1. The reference to material in Chapter 3 is given only for the purpose 
of a general orientation in evaluating the possibilities of using Lemma 2.1.1, 
presented below. 

* Translator’s note. The second of these papers has since been published in Probability 


Theory on Vector Spaces. III (Proceedings, Lublin, 1983), Lecture Notes in Math., vol. 
1080, Springer-Verlag, 1984, pp. 306-364. 
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2.2. There are several cases in which the densities of stable laws can be 
expressed by means of special functions. Corresponding formulas are given in 
82.8. 


2.3. The idea of using the representations (2.2.8)-(2.2.10) in analyzing 
the properties of stable distributions stems from Linnik [53], who considered 
formally only the case a < 1. The equality (2.2.11) was given in the author’s 
paper [97]. 


2.4. The representation (2.2.13) was obtained by Skorokhod [79]. 


2.5. Theorem 2.2.3 was proved in a somewhat modified form in the au- 
thor’s paper [98]. 


2.6. The fact that the stable distributions with a < 1, B= 1, and y =0 
are entirely concentrated on the semi-axis z > 0 was observed independently 
by several authors: Bergstrom [2], Linnik [53], and Ovseevich and Yaglom 
[132]. However, Lévy ((113) in [50]) was the first to direct attention to it. 


2.7. Theorem 2.3.1 (more precisely, the equality (2.3.3) corresponding to 
the form (A)) is the main result in the author’s article [95]. 


2.8. In [78] and [80] Skorokhod proved an assertion equivalent to the as- 
sertion of Theorem 2.4.1 in the part corresponding to the cases a < 1 and 
œ= 1. The fact that the density g(x, a, 3) is an entire analytic function when 
œ > 1 was mentioned in the book [26] of Gnedenko and Kolmogorov, with a 
reference to A. I. Lapin. 


2.9. The representations (2.4.6) and (2.4.8) were found independently by 
Bergstrom [2], Feller [21], and Chao Chung-Jeh [9]. These series expansions 
for the densities g(z,a,0) were also obtained by Wintner [89]. For the case 
3 = 1 the equality (2.4.8) is mentioned in a paper of Pollard [67]. In the cited 
paper Bergstrom refers to a paper of Humbert in 1945, where the formal 
expansion (2.4.8) is given for the function g(z,a,1) with a < 1. 


2.10. After the power series expansions of the entire functions q(x, a, b) 
with a Æ 1 were found, computation of their order ø and type 6 was not 
difficult, so the values of o and 6 were known to anyone interested in the 
properties of stable distributions. Theorem 2.4.3 first appeared in the book 
[54] of Lukacs in a somewhat modified formulation. However, the proof there 
of the fact that ø = co when a = 1 and 8 Æ 0 is erroneous. Our version of the 
proof was suggested by Ostrovskii, who also pointed out the precise equality 

lim sup r+ log log M(r) = 1/8. 


T= OO 
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2.11. N. V. Smirnov knew of the possibility of expanding the function 
q(z,a@,1), a <1, in Laguerre polynomials. I had occasion to hear about this 
at a seminar of Kolmogorov (who was working at Moscow State University 
during 1954-1955). 


2.12. Kolmogorov was the first to point out the asymptotic relation 
(2.5.18), without a rigorous proof, at the seminar mentioned in Comment 
2.11. The first terms in the asymptotic expansion (2.5.17) were obtained in 
the case a < 1 by Linnik [53] and in the case a 2 1 by Skorokhod [79]. The 
asymptotic expansion (2.5.17) appeared in full scope, in a somewhat modified 
form, in the book [35] of Ibragimov and Linnik. It should be mentioned that 
the formulas presented there contain misprints. 


2.13. The asymptotic expansion (2.5.25) was found by Skorokhod [79] 
(without explicit expressions for the coefficients Sys) 


2.14. The equality (2.6.10) was obtained in the case a < 1 by Pollard [67], 
and in the case a@ > 1 by the author [97]. 


2.15. The equality (2.6.11) was proved by the author in [97]. 
2.16. The relation (2.6.15) is equivalent to (3.3.10). 
2.17. Theorem 2.6.3 was proved by the author [97]. 
2.18. The equality (2.6.26) was proved by the author in [97]. 


2.19. The class L is a subset of 6. It can be defined as the set of all 
possible complete limits* of distributions of increasing sums of independent 
random variables (X; + --.+ Xn)B3 1 — A, with the property that 


max(Xı,..., Xn)B7! $0 as n > oœ, 


or as the set of infinitely divisible distributions with absolutely continuous 
spectral functions H such that the product £zH’(x) is nonincreasing on the 
semi-axes z < 0 and z > 0. 


2.20. Theorem 2.7.2 is due to Khintchine [44]. 


2.21. Theorem 2.7.3 is a probabilistic interpretation of a known unimodal- 
ity criterion (given in Corollary 2) of Khintchine. An equivalent statement in 
terms of the characteristic transforms (2.7.5) was presented by the author in 
[101]. It is mentioned in Feller’s book [22] that Shepp also pointed out this 
interpretation. 


“We have in mind “complete” convergence. 
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2.22. Lemma 2.7.2 was used by Wintner to prove Theorem 2.7.4 in [86]. 
We present a new proof of this lemma. 


2.23. Theorem 2.7.5 was proved by Ibragimov and Chernin [34]. 


2.24. The system of operators used by us, which can be interpreted as frac- 
tional integration and fractional differentiation, is not unique. For example, 
we can use a system of operators of the form (0 < r < 1) 


Tha) = a. (x — t)’~*h(t) dt, 


ila) = Trees |_to =h t) T dt. 


There are also other types of operators with analogous properties. For further 
information we refer the reader to Feller’s paper [21]. Wolfe’s paper [92] is 
one of the few articles which uses fractional differentiation and integration 
operators in probability theory problems. 


2.25. Related in structure to (2.8.12) is an integrodifferential equation in 
the book [35] of Ibragimov and Linnik (Theorem 2.3.2) for the densities of 
stable laws; however, the equation contains an essential error. The error 
arises because the complex factor exp(—irr) is absent in the definition of the 
fractional integration operator (which is analogous to that used by us). As a 
result, the operator inverse to the one defined by them cannot be regarded as 
a generalization of the concept of differentiation. 

In the same place it is asserted that an integrodifferential equation of the 
form (2.8.12) can be transformed into the differential equation (2.8.25) in the 
case when a is rational. Actually, this equation is obtained as a consequence 
of an integrodifferential equation of the form (2.8.20). 


2.26. In considering the densities g(x, a, 3,0, À) of unbiased strictly stable 
laws, Medgyessy [62] derived for them a partial differential equation in the 
case when a = m/n, a rational number with relatively prime m and n: 


3 0% +b; 
2 K; 023 Os g=0, 


where aj, bj, and K; are constant numbers dependent on m, n, G, and a certain 
free parameter whose variation gives a family of equations. If we take account 
of the equality 

j= \-1/%9(rA—1/%, a, 8,0, tj, 


then the connection between this equation and (2.8.24) becomes obvious. 


2.27. The equation (2.8.26) is contained in Linnik’s paper [53]. 
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2.28. It is not hard to transform the right-hand side of (2.8.29) to the form 
1 : 
~Re{ vrs exp(—¢”) — 2içw(¢)}, 


where ¢ = —iz/2 and w(¢) = exp(—¢?) fó exp(t?) dt. The function w(¢) is 
tabulated for complex ¢ in [42]. 


2.29. The equation (2.8.33) was established by Pollard [67]. 


2.30. Theorem 2.9.1 was proved by the author [97]. 
The paper [13] of Cressie studies the properties of the distribution 


Ra(z) = P(|Y (a, 8,0, A)® < 2), a< i, 


and also solves problems such as the representation of the density R’ (x) by 
a series in inverse powers of z, and the limit behavior of R,(z) as a > 0 (we 
use our own notation here). It is easily seen that 


Bals) = G(c/, a, B,0,A).— G(z!/*, a, —8,0, A); 


hence both of these problems get an obvious solution if one uses the elementary 
property Y (a, 8,0, A) = \1/*Y (a, 0) and invokes (2.4.8) and the limit relation 
(2.9.1). 


2.31. The asymptotic expansion (2.9.4) was obtained by the author in 
[97]. It is proved in the paper [7] of Brockwell and Brown that the asymptotic 
expansion (2.9.4) is a convergent series for all a < 1 in the case when G=1 
and x > 0. The structure of the asymptotic expansion (2.9.4) shows that its 
convergence for any admissible a < 1 and 9 is a consequence of the result 
mentioned. 


2.32. The asymptotic expansion (2.9.8) was presented in the author’s pa- 
per [97]. However, the expression given there for the function H; is erroneous. 


2.33. Theorem 2.10.1 is a result of the author [96]. 


2.34. The connections (2.10.9) and (2.10.12) between the Mittag-Leffler 
_ functions and stable laws were pointed out by the author in [96] and [97]. 


2.35. In some of his papers Studnev considered a generalization of the 
concept of infinitely divisible distributions within the limits of the set of func- 
tions V(x) of bounded variation that satisfy the conditions V(—oo) = 0 and 
V (co) = 1. The 1967 paper On some generalizations of limit theorems in prob- 
ability theory (Teor. Veroyatnost. i Primenen. 12 (1967), 729-734=Theor. 
Probab. Appl. 12 (1967), 668-672) contains a remark about the class of “gen- 
eralized stable laws” and gives a form for the corresponding Fourier-Stieltjes 
transforms. Although this class contains the Airy function, the question of 
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whether it is a natural extension of the family of stable distributions has not 
been considered so far. For a criterion for “naturalness” we can take, in par- 
ticular, the duality law, which is satisfied by the Airy function, as (2.10.15) 
shows. 


Chapter 3 


3.1. With the exception of Theorem 3.4.3 and the information in §3.5, 
the material in this chapter is based on results of the author in [97] (mainly 
the second part of it). There are several publications of other authors whose 
results are directly or indirectly connected with the facts presented below. 
These cases will be mentioned and commented on separately. 

We note one important detail. 

In the case when the random variable X takes both positive and negative 
values with positive probability, the use of the proposed notation for the 
generalized power can give rise to an ambiguous situation, since, for example, 
X? can be understood as |X|? or as |X|?sgn(X). Therefore, it would be 
more correct to give the generalized power its own notation, say XV = 
|X|*sgn X. We did not do this, in order to achieve a certain simplification 
in the formulas. This will not cause any confusion in this chapter if we agree 
always to understand a power of a normally distributed random variable N 
in the usual sense, i.e., N2* = |N|?*, when k is an integer. 


3.2. The statistical interpretation of the concept of a cutoff is as follows. 
If X,,...,Xn is an independent sample subordinate to the same distribution 
as the random variable X, then by discarding among the X; only those are 
positive, we obtain a sample X;,,..., Xir (of random size T subordinate to 
the binomial distribution) in which the elements have the same distribution 
as a cutoff X. 


3.3. The fact that the random variable 1/X has a Cauchy distribution if X 
itself does has apparently been known for a long time, but the author has not 
been able to find any references prior to his paper [97]. Menon [63] obtained 
the following interesting characterization of distributions of Cauchy type. 
Suppose that the random variables X and 1/X are stable, and 1/X 4 h(X), 
where h(x) = Ar + O(1), A is a constant, and h'(x) = A+ O(|z|~*), € > 0, 
as |z| + oo. Then X has a distribution G.4(z, 1,0, 7, À). 


3.4. The following interesting equality can be derived from (3.3.4) if we 
choose a = 4 and use the explicit expression for the density g(x, 5,1): 


oO 1 a’ 
—su j me | d =a yT —3/2 1 , 
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The equality (3.4.26) is a particular case of it (if we choose a/ = +). 


3.5. The simplest case’of the distribution of the statistic L is obtained 
when a = 2. This follows from (3.3.21) and (3.3.17): 
a Z(2, 1/2) 
- Z(2,1/2) 
i.e, the statistic L, which in this case represents the ratio N/N of two indepen- 
dent random variables distributed according to the standard normal law, has 
a Cauchy distribution, a fact well known in probability theory. The following 
relation (a consequence of (3.3.17)) is analytically related to it: 
Z(a,1/a) a 1 
>—— = 1, — <2 
Z(a, 1/a) aC] ie 
3.6. Theorem 3.4.3 and its corollary are due to Williams [85]. In his proof 
the form of the Mellin transform M (—s,1/n,1) is found independently with 
the help of a clever argument which we regard as useful to present. If œ = 1/n, 
then for any s > 0 


exp(—s*) = Eexp(—sY(a)) = P(sY(a) < E), 


where E is a random variable independent of Y (œ) and having an exponential 
distribution. Since 


U=Z(1,3)U4Y(1,0), 


P((E/Y(a))* > s*) = exp(—s?), 
it follows that 
(E/Y(a))*£E and Y(a)/EŻ E-e, 
Consequently, for any r > =a 
E(Y~"(a)E") = E(E"/*) =T(1+ r/a) 
and, moreover, since Y (œ) and E are independent, 
E(Y~"(a)E") = EY’ (a)E(E’) = EY~"(a)I'(1+7). 
A comparison of the above equalities shows that 
M(-r,a,1) = EY~"(a) =T(1+7r/a)/T(1+ r). 


Chapter 4 


4.1. The material in this chapter is for the most part taken from the au- 
thor’s paper [167]. Theorem 4.3 exploits a technique of Januškevičienė [166], 
where, in particular, an analogous problem is solved. 
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List of Notation (*) 


M, class of M-infinitely divisible 
distributions 

6S, family of SD’s 

W, class of strictly SD’s 

1, class of distributions in W with 
Ac = 1 

6, class of IDL’s 

A = àB, ÀA, a parameter—the time in 
stable processes, and the scale 
parameter for strictly SD’s (in the 
form (B), (A)) 

6 = Bg, BA, a parameter—the degree of 
asymmetry of an SD (in the form 
(B), (A)) 

Yy = YB, YA, the bias characteristic of 
an SD (in the form (B), (A)) for 
fixed A 

K(a) = a — 1 + e(a) = sgn(1 — a), 
functions in certain forms of 
expression for the CF of an SD 

g(t, œ, b, Yy, à) = g(t), the CF of an SD 
with parameter system in the form 
(A) 

a, the main parameter of an SL, the same 
in all the forms except for the form 
(E) 

g(x, œ, B, Y, à), the density of an SD 
corresponding to the form (A) or 
(B) 

g(x, a, 3) = g(x, a, B,0,1), the density of 
a standard SD, in the form (B) 
beginning with §2.2 

G(x, a, 8,7,), DF of an SD 


G(z, a, 8) = G(z, a, B,0, 1), DF of a 
standard SD, in the form (B) 
beginning with §2.2 

0, a parameter—the degree of assymetry 
of an SD in the forms (C) and (E) 

p = (1+ @)/2, a parameter replacing the 
parameter ô in the form (C) 

v = 1/a?, a parameter replacing the 
parameter a in the form (E) 

T, a parameter responsible for scale 
changes of an SD in the form (E) 

Y (a, B,y,A), a random variable with SD 
in the form (B) (sometimes also in 
the form (A), if explicitly 
mentioned) 

Y (a, B) = Y (a, 6,0, 1), a random variable 
with a standard SD, in the form (B) 
beginning with §2.2 

Y (a, 0) = Yc (a, 0), a random variable 
with strictly SD in the form (C) and 
with DF Go(z, a, @) 

Z(a,p), a cutoff of a random variable 
Y (a, 6) 

Y (a), abbreviated notation for a random 
variable Y(a,1) forO<a< 1 

M(s,a,p), the Mellin transform of the 
density of the SD of a variable 
Z(a, p) 

wz (s,a,0), k = 0,1, an element of the 
characteristic transform of the SD 
with density gc (x, a, 0, A) 

E, a random variable with DF 
1 — exp(—z), t> 0 


(*)Abbreviations: IDL = infinitely divisible law; SD (SL) = stable distribution (stable 
law); CF = characteristic function; DF = distribution function. 
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law); CF = characteristic function; DF = distribution function. 
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